← Back to Home
Notes & Learning
Soumyadeep Roy
📘 Reinforcement Learning
Understanding the Bellman Equation
Dynamic Programming Methods in RL (Value Iteration & Policy Iteration)
Model Free RL : Monte Carlo & TD learning
Function Approximation in RL
Stochastic Approximation for RL: Tools & Techniques
Policy Gradient & RLHF
Actor–Critic Methods
Offline RL
📊 Machine Learning
Concept Map in ML
Decision Theory
Density Estimation & Probabilistic Modeling
🤖 LLM & Transformers
Interactive Transformer Simulator
🎨 Generative Models
Diffusion Model Simulator