Paper Reading
- <2023-10-29>Woodpecker: Hallucination Correction for Multimodal Large Language Models
- <2023-06-30>AdaPlanner & LLM Weights
- <2023-05-24>Diffusion Models and RL
- <2022-11-18>RL with Causal Reasoning
- <2022-10-14>Factored Adaption for Non-stationary RL
- <2022-03-25>RL and Language Models
- <2020-07-26>Background and decision-time planning
- <2020-07-06>Model-based RL with uncertainty
- <2018-12-26>MCTS Introduction
Book Reading
- Reinforcement Learning: An Introduction (Richard S. Sutton and Andrew G. Barto)