MTech (Research),Department of Computer Science & Automation(CSA),Indian Institute of Science (Bengaluru)
I am an MTech (Research) student at the Department of Computer Science & Automation(CSA),IISc Bangalore working on theoretical Reinforcement Learning, including offline RL,regret analysis & sample complexity. I am also interested in LLM alignment and RLHF.And more recently,GenAI.
Depart. of CSA,Indian Institute of Science (IISc), Bangalore
Govt. College of Engineering & Ceramic Technology, Kolkata
Working in RL,in particular offline RL.
Conducted tutorials, invigilated examinations, and prepared problem sets, assignments, and exam papers.
Worked at Centre for Neuroscience (CNS), IISc.
Spaced repetition Telegram bot integrated with Google Calendar to automate daily problem revisions.
GitHub βBuilt a reproducible framework for efficient LLM fine-tuning and alignment using LoRA, QLoRA, Reward Modeling, and Direct Preference Optimization (DPO).
GitHub βMachine learning platform for predicting employee attrition using ensemble models, explainability, and real-time inference.
GitHub βAI-powered research operating system with LLM councils, experiment orchestration, and automated infrastructure management.
GitHub βInteractive guide mapping 19 core DSA patterns and dependencies across 7 learning tiers.
GitHub β
Sample Efficient Active Algorithms for Offline Reinforcement Learning
[arxiv.org]
π Mailing Address:
Department of Computer Science & Automation (CSA)
Indian Institute of Science
Bangalore β 560012
Karnataka, India