Projects

// GenAI & NLP

RAG / NLP

UK Visa RAG Chatbot

RAG chatbot that ingests UK Immigration Rules via GOV.UK API, embeds them with sentence-transformers (all-MiniLM-L6-v2), stores in FAISS/ChromaDB, and generates answers with source attribution through a Streamlit chat interface.

RAG FAISS ChromaDB Streamlit
LLM Engineering

Local Peer Review

Automated academic peer-review generator that verifies citations, runs adversarial probes, and produces structured referee reports. Supports Ollama, OpenAI, Groq, and LM Studio backends with a Gradio web interface.

LLM Ollama Gradio Python
LLM Application

Resume Customizer

Automatically tailors LaTeX resumes to match job descriptions using the Google Gemini API, with structured output parsing and PDF generation.

Gemini API Python LaTeX
LLM Tooling

LaTeX for LLM

Converts LaTeX documents into a typed graph for targeted LLM context retrieval — graph-based queries cut prompt size by ~54% on average (up to ~80% for focused queries). Features cross-reference resolution, inline math preservation, interactive force-directed graph visualization, and a Claude MCP server.

Python MCP Graph NLP
Web App

arXiv RSS Filter

Streamlit app that filters and curates arXiv preprint RSS feeds, helping researchers stay on top of relevant new papers in their field.

Streamlit RSS Python

// Machine Learning

Classification

UK Accident Severity Classification

Dual-strategy ML system on 151K UK road accidents: an emergency response model (LightGBM, 92.4% severe recall) and a traffic management model (81% macro recall) using SMOTE+Tomek, ADASYN, and custom threshold optimization.

LightGBM Scikit-learn SMOTE ADASYN
Reinforcement Learning

RL Traffic Signal Control

Trained a reinforcement learning agent to control a single traffic light at an intersection using real traffic patterns from the TfL API to minimize vehicle waiting time.

Deep RL PyTorch TfL API
Deep Learning

Deep Reinforcement Learning

Implementations from Deep RL in Action covering policy gradients, Q-learning, and multi-agent environments using PyTorch.

PyTorch Deep RL Neural Nets

// Quantitative Finance

Risk Analytics

Risk Management Toolkit

Comprehensive risk analysis: MACD, Bollinger Bands, PSAR, stochastic oscillators, CAPM, Sharpe/Treynor/Jensen metrics, historical & parametric VaR, plus signal denoising with Kalman filters and wavelet transforms.

VaR CAPM Kalman Filter Wavelets
Time Series

Time Series Forecasting

Equity price modeling with ARIMA, GARCH, CCC-GARCH, and Prophet. Rolling & fixed window forecasting, bootstrap simulation, and volatility analysis using Yahoo Finance data.

ARIMA GARCH Prophet Forecasting
Backtesting

Backtesting Framework

Strategy backtesting engine using VectorBT, progressing from simple MA-based strategies to ML-enhanced signals with volatility-adjusted GARCH components.

VectorBT GARCH ML Signals
Algorithmic Trading

ML Trading Signals

Momentum-based trading signal generation using MA ribbons, distance metrics, correlation-based signals, and physics-inspired momentum measurements.

ML Momentum Signal Processing
Portfolio Theory

AI for Trading

Udacity nanodegree projects: momentum strategy, breakout strategy, and smart beta portfolio optimization with performance evaluation.

Momentum Smart Beta Portfolio
Web App

Mutual Fund Analyzer

Deployed Streamlit app using Morningstar data for portfolio analysis — sectoral distribution, scheme comparison, valuations, and company exposure tracking.

Streamlit Altair Scikit-learn
Web App

NSE Stocks Dashboard

Streamlit dashboard for real-time stock analysis and monitoring of equities listed on the National Stock Exchange of India.

Streamlit Finance Dashboard

// Astrophysics Research

Astrophysics

AGN & Local Environment in HR5

Investigated effects of local environment on Active Galactic Nuclei activity using the Horizon Run 5 cosmological simulation with Python-based feature extraction pipelines on terabyte-scale data.

Python HPC Simulations
Astrophysics

Dust Radiative Transfer (HR5)

Modified the Powderday radiative transfer code for Horizon Run 5 AMR data. Built pipelines for galaxy selection, sub-cube extraction from HDF5 snapshots, and SED computation with MPI parallelization.

Python Cython MPI HDF5
Astrophysics

Cosmic Web Analysis at Gigaparsec Scales

Developed a multi-block approach for the DisPerSE cosmic web finder enabling analysis at unprecedented gigaparsec scales.

Python C HPC Algorithms
Astrophysics

Ram Pressure Stripping Models

Analytical and simulation-based study of ram pressure stripping dependence on orbital parameters of galaxies.

Python Analytics Simulations