Current Project: MCP Server-based LLM-based AI agent for Diabetes Monitoring(2025-present)

Built a Diabetes Monitoring AI agent that connects real patient-generated data from Apple Health to an Amazon RDS MySQL backend and exposes it to AI through the Model Context Protocol. The server provides five callable MCP tools that enable AI clients like Claude Desktop to retrieve glucose, sleep, and exercise records, detect temporal and behavioral patterns, and compute correlations between lifestyle factors and glucose outcomes using techniques such as Pearson’s correlation. It uses Python, SQLAlchemy ORM, PyMySQL, and environment-based configuration to securely query timestamped CGM readings, sleep-stage breakdowns, and workout durations. The system enables end-to-end analytics, Apple Health data ingestion, typed MySQL storage, and AI-driven insight generation—without dashboards or manual preprocessing, accelerating experimentation in HealthTech workflows.
[GitHub]


Health Data Export System (Aug 2025 - Present)
Health Data Export System is a scalable data ingestion and analytics pipeline designed to process real-time health data exported from the iPhone Auto Export (Apple Health) app. The system supports two ingestion modes: a Flask-based REST API server for direct JSON uploads and a FastAPI gateway that forwards requests to a backend gRPC service for high-throughput, low-latency processing. Incoming health metrics are validated, transformed, and written to an AWS RDS MySQL database, with Pandas used for time-series processing of glucose and insulin events.
[GitHub]


AI-powered HackerRank Candidate Screening Agent (Nov 2025 - Dec 2025)
Built an AI-powered HackerRank Candidate Screening Agent using MCP, an end-to-end automation system that streamlines technical hiring by orchestrating real production workflows through Anthropic’s Model Context Protocol. The agent fetches and paginates candidate results via HackerRank’s REST APIs, applies configurable pass thresholds and ranking logic, automatically advances top performers to harder assessments, sends personalized email notifications, and schedules recruiter interviews using Google Calendar. Core screening actions are exposed as MCP tools, enabling assistants like Claude Desktop to safely trigger backend workflows.
[GitHub]


Platform Migration (2024-2025)
This project examines the impact of platform migration on the specific community during their transition from Twitter to alternative platforms like Parler and Dotwin following initial widespread account bans on Twitter. We examine how community dynamics—including user roles and activities—evolved amidst this migration by analyzing their activities on Twitter, Parler, and Dotwin in the months leading up to their ban. We assess user engagement and influence changes, categorizing users into five distinct roles: ’common users,’ ‘broadcasters,’ ’influentials,’ ’hidden influentials,’ and ’lurkers.’ Conducted temporal analyses of weekly fluctuations in user activity, revealing significant trends and patterns in engagement across mainstream and alternative platforms. Introduced lexical analysis to examine the nature of conversations, categorizing content into themes such as violence and conspiracy to uncover underlying narratives within the specific community.
[GitHub]

The findings of this project were selected for a poster presentation at ACM GROUP 2025[PDF], and a full paper was submitted to ACM CSCW 2025[PDF], which is under review.

Machine Learning project (2024)
Achieved a 22nd rank out of 142 students in a Kaggle competition centered on the Old Bailey dataset, a binary classification problem. Implemented various machine learning models, including perceptrons, support vector machines (SVMs), logistic regression, and ensemble methods like AdaBoost. Focused on data preprocessing by integrating multiple datasets (e.g., bag-of-words, TF-IDF, and GloVe embeddings), handling missing values, and applying label encoding to categorical features. Tuned hyperparameters extensively and optimized performance by combining datasets and utilizing advanced ensemble techniques. This project demonstrated my ability to apply machine learning concepts to a complex real-world dataset.
[GitHub]


Machine Learning assignments (2024)
Various machine learning techniques used:

  1. Decision Trees: I have developed expertise in constructing decision trees using features, calculating entropy and information gain, implementing the ID3 algorithm, and managing tree depth to prevent overfitting.

  2. PAC Learnability and VC Dimension: My knowledge includes theoretical aspects of machine learning, such as PAC learnability, concept classes, sample complexity bounds, and analyzing the VC dimension of hypothesis spaces.

  3. Logistic Regression: I have worked on optimizing logistic regression models using stochastic gradient descent (SGD), applying regularization techniques, and fine-tuning parameters for enhanced performance.

  4. Support Vector Machines (SVMs): I have implemented SVMs using SGD, conducted hyperparameter tuning, and evaluated models with metrics like precision, recall, and F1 scores to ensure robust performance.

  5. Ensemble Methods: My experience includes applying ensemble techniques like bagging and combining decision trees with SVMs to improve predictive accuracy and model robustness.

  6. Perceptron Algorithm and Variants: I have extensively worked on the Perceptron algorithm, exploring its variants such as margin Perceptron, averaged Perceptron, and aggressive updates, while incorporating cross-validation and hyperparameter tuning.

  7. Application to Real-world Data: I have applied these machine learning techniques to real-world datasets like the Mushroom Dataset and Diabetic Retinopathy data, showcasing my ability to translate theoretical knowledge into practical problem-solving.
    [GitHub]


Natural Language Processing (2024)
Developed and optimized NLP models, including logistic regression, feedforward neural networks, and transformers, for sentiment classification, language modeling, and text summarization tasks. Implemented advanced NLP techniques, such as feature extraction, deep averaging networks, transformer-based language modeling, and text generation evaluation using BLEU, ROUGE, and BERTScore.”
[GitHub]


The DIY approach in T1 diabetes (2023)
In this project, we aimed to develop a deep insight into how the type-1 diabetic(T1D) community works to solve their issues without institutional experts. This led to our research question: “How do the type-1 diabetic community create their own knowledge and solve their own issues.”
[PDF]


Assessing the Influence of the Covid 19 Pandemic on Indian Pharmaceutical Companies (2022)
Our objective is to study, analyze and draw inferences on the movement of the stock prices of Indian pharmaceutical companies solely based on the COVID-19 pandemic in India. We specifically targeted pharmaceutical stocks because their share price is more directly dependent on the COVID-19 pandemic than companies in other sectors. As the demand for the primary products sold by pharmaceutical companies, i.e., medicines, is directly dependent on the COVID-19 pandemic, a common hypothesis is that the stock prices of pharmaceutical companies at a given time are significantly contingent upon the COVID-19 pandemic situation. We have tested this hypothesis by calculating the correlation between pharmaceutical stock prices and COVID-19 variables that measure the severity and provide an outline of the COVID-19 pandemic. Furthermore, as human emotion plays a significant part in deciding the share prices, we have considered public fear and awareness by considering the frequency by which the terms “Covid 19” and “Covid medicines” are searched on Google. The data we have considered for our study belongs to the period from 15th March 2020 to 17th February 2022.
[GitHub] [PDF]