Projects
// GenAI & NLP
UK Visa RAG Chatbot
RAG chatbot that ingests UK Immigration Rules via GOV.UK API, embeds them with sentence-transformers (all-MiniLM-L6-v2), stores in FAISS/ChromaDB, and generates answers with source attribution through a Streamlit chat interface.
Local Peer Review
Automated academic peer-review generator that verifies citations, runs adversarial probes, and produces structured referee reports. Supports Ollama, OpenAI, Groq, and LM Studio backends with a Gradio web interface.
Resume Customizer
Automatically tailors LaTeX resumes to match job descriptions using the Google Gemini API, with structured output parsing and PDF generation.
LaTeX for LLM
Converts LaTeX documents into a typed graph for targeted LLM context retrieval — graph-based queries cut prompt size by ~54% on average (up to ~80% for focused queries). Features cross-reference resolution, inline math preservation, interactive force-directed graph visualization, and a Claude MCP server.
arXiv RSS Filter
Streamlit app that filters and curates arXiv preprint RSS feeds, helping researchers stay on top of relevant new papers in their field.
// Machine Learning
UK Accident Severity Classification
Dual-strategy LightGBM classifiers on 104K DfT STATS19 collisions merged with 190K vehicle records (42 features). Severe-optimized model (0.527 macro recall, 66.8% accuracy) with threshold tuning for adjustable recall-accuracy tradeoff. Results benchmarked against published studies on the same dataset.
RL Traffic Signal Control
Deep RL agent that learns adaptive signal timing policies for a single intersection using real traffic flow data from the Transport for London API, replacing fixed-cycle control with a learned policy trained via PyTorch.
Deep Reinforcement Learning
Implementations from Deep RL in Action covering policy gradients, Q-learning, and multi-agent environments using PyTorch.
// Quantitative Finance
Risk Management Toolkit
End-to-end risk analytics pipeline: raw price data is denoised with Kalman filters and wavelet transforms, fed into technical indicators (MACD, Bollinger, PSAR, stochastic oscillators), then evaluated via historical & parametric VaR, CAPM beta, and Sharpe/Treynor/Jensen performance metrics.
Time Series Forecasting
Comparative equity modeling: ARIMA for trend forecasting, GARCH for volatility clustering, CCC-GARCH for multi-asset portfolio-level forecasting, and Prophet for seasonality. Includes rolling & fixed window evaluation, bootstrap simulation, and model diagnostics on Yahoo Finance data.
Backtesting Framework
Strategy backtesting engine built on VectorBT's vectorized execution, progressing from moving-average crossover strategies to volatility-regime-aware ML signals incorporating GARCH-estimated conditional variance.
ML Trading Signals
Momentum-based trading signal generation using MA ribbons, distance metrics, correlation-based signals, and physics-inspired momentum measurements.
AI for Trading
Udacity nanodegree projects: momentum strategy, breakout strategy, and smart beta portfolio optimization with performance evaluation.
Mutual Fund Analyzer
Live Streamlit Cloud app that ingests Morningstar fund data to build multi-fund portfolios and analyze them across sectoral distribution, scheme comparison, valuations, and top company exposure.
NSE Stocks Dashboard
Streamlit dashboard fetching live data from the NSE API for real-time stock analysis, price monitoring, and technical charting of equities listed on the National Stock Exchange of India.
// Astrophysics Research
AGN & Local Environment in HR5
Investigated effects of local environment on Active Galactic Nuclei activity using the Horizon Run 5 cosmological simulation with Python-based feature extraction pipelines on terabyte-scale data.
Dust Radiative Transfer (HR5)
Modified the Powderday radiative transfer code for Horizon Run 5 AMR data. Built pipelines for galaxy selection, sub-cube extraction from HDF5 snapshots, and SED computation with MPI parallelization.
Cosmic Web Analysis at Gigaparsec Scales
Developed a multi-block approach for the DisPerSE cosmic web finder enabling analysis at unprecedented gigaparsec scales.
Ram Pressure Stripping Models
Analytical and simulation-based study of ram pressure stripping dependence on orbital parameters of galaxies.
