Projects

// GenAI & NLP

RAG / NLP

UK Visa RAG Chatbot

RAG chatbot that ingests UK Immigration Rules via GOV.UK API, embeds them with sentence-transformers (all-MiniLM-L6-v2), stores in FAISS/ChromaDB, and generates answers with source attribution through a Streamlit chat interface.

RAG FAISS ChromaDB Streamlit
LLM Engineering

Local Peer Review

Automated academic peer-review generator that verifies citations, runs adversarial probes, and produces structured referee reports. Supports Ollama, OpenAI, Groq, and LM Studio backends with a Gradio web interface.

LLM Ollama Gradio Python
LLM Application

Resume Customizer

Automatically tailors LaTeX resumes to match job descriptions using the Google Gemini API, with structured output parsing and PDF generation.

Gemini API Python LaTeX
LLM Tooling

LaTeX for LLM

Converts LaTeX documents into a typed graph for targeted LLM context retrieval — graph-based queries cut prompt size by ~54% on average (up to ~80% for focused queries). Features cross-reference resolution, inline math preservation, interactive force-directed graph visualization, and a Claude MCP server.

Python MCP Graph NLP
Web App

arXiv RSS Filter

Streamlit app that filters and curates arXiv preprint RSS feeds, helping researchers stay on top of relevant new papers in their field.

Streamlit RSS Python

// Machine Learning

Classification

UK Accident Severity Classification

Dual-strategy LightGBM classifiers on 104K DfT STATS19 collisions merged with 190K vehicle records (42 features). Severe-optimized model (0.527 macro recall, 66.8% accuracy) with threshold tuning for adjustable recall-accuracy tradeoff. Results benchmarked against published studies on the same dataset.

LightGBM Feature Engineering Imbalanced Classification Streamlit
Reinforcement Learning

RL Traffic Signal Control

Deep RL agent that learns adaptive signal timing policies for a single intersection using real traffic flow data from the Transport for London API, replacing fixed-cycle control with a learned policy trained via PyTorch.

Deep RL PyTorch TfL API
Deep Learning

Deep Reinforcement Learning

Implementations from Deep RL in Action covering policy gradients, Q-learning, and multi-agent environments using PyTorch.

PyTorch Deep RL Neural Nets

// Quantitative Finance

Risk Analytics

Risk Management Toolkit

End-to-end risk analytics pipeline: raw price data is denoised with Kalman filters and wavelet transforms, fed into technical indicators (MACD, Bollinger, PSAR, stochastic oscillators), then evaluated via historical & parametric VaR, CAPM beta, and Sharpe/Treynor/Jensen performance metrics.

VaR CAPM Kalman Filter Wavelets
Time Series

Time Series Forecasting

Comparative equity modeling: ARIMA for trend forecasting, GARCH for volatility clustering, CCC-GARCH for multi-asset portfolio-level forecasting, and Prophet for seasonality. Includes rolling & fixed window evaluation, bootstrap simulation, and model diagnostics on Yahoo Finance data.

ARIMA GARCH Prophet Forecasting
Backtesting

Backtesting Framework

Strategy backtesting engine built on VectorBT's vectorized execution, progressing from moving-average crossover strategies to volatility-regime-aware ML signals incorporating GARCH-estimated conditional variance.

VectorBT GARCH ML Signals
Algorithmic Trading

ML Trading Signals

Momentum-based trading signal generation using MA ribbons, distance metrics, correlation-based signals, and physics-inspired momentum measurements.

ML Momentum Signal Processing
Portfolio Theory

AI for Trading

Udacity nanodegree projects: momentum strategy, breakout strategy, and smart beta portfolio optimization with performance evaluation.

Momentum Smart Beta Portfolio
Web App

Mutual Fund Analyzer

Live Streamlit Cloud app that ingests Morningstar fund data to build multi-fund portfolios and analyze them across sectoral distribution, scheme comparison, valuations, and top company exposure.

Streamlit Altair Scikit-learn
Web App

NSE Stocks Dashboard

Streamlit dashboard fetching live data from the NSE API for real-time stock analysis, price monitoring, and technical charting of equities listed on the National Stock Exchange of India.

Streamlit Finance Dashboard

// Astrophysics Research

Astrophysics

AGN & Local Environment in HR5

Investigated effects of local environment on Active Galactic Nuclei activity using the Horizon Run 5 cosmological simulation with Python-based feature extraction pipelines on terabyte-scale data.

Python HPC Simulations
Astrophysics

Dust Radiative Transfer (HR5)

Modified the Powderday radiative transfer code for Horizon Run 5 AMR data. Built pipelines for galaxy selection, sub-cube extraction from HDF5 snapshots, and SED computation with MPI parallelization.

Python Cython MPI HDF5
Astrophysics

Cosmic Web Analysis at Gigaparsec Scales

Developed a multi-block approach for the DisPerSE cosmic web finder enabling analysis at unprecedented gigaparsec scales.

Python C HPC Algorithms
Astrophysics

Ram Pressure Stripping Models

Analytical and simulation-based study of ram pressure stripping dependence on orbital parameters of galaxies.

Python Analytics Simulations