← Back to Home

Notes & Learning

Soumyadeep Roy

📘 Reinforcement Learning

Understanding the Bellman Equation
Dynamic Programming Methods in RL (Value Iteration & Policy Iteration)
Model Free RL : Monte Carlo & TD learning
Function Approximation in RL
Stochastic Approximation for RL: Tools & Techniques
Policy Gradient & RLHF
Actor–Critic Methods
Offline RL

📊 Machine Learning

Concept Map in ML
Decision Theory
Density Estimation & Probabilistic Modeling

🤖 LLM & Transformers

Interactive Transformer Simulator

🎨 Generative Models

Diffusion Model Simulator