≃
Calvin's Notebook
home
archive
notes
☀
🌙
Behavioral Cloning and Imitation Learning
Q-Learning and Value-Based Reinforcement Learning
Rejection Sampling as Reinforcement Learning
The Landscape of Reinforcement Learning
RL for Query-Based Search Ranking
RL Techniques for Value Modeling in Search and Recommendation Ranking