Reinforcement-Learning

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

Paper-reading notes: AlphaZero

Paper-reading notes: AlphaGo Zero

Paper-reading notes: DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper-reading notes: Mastering the game of Go with MCTS and Deep Neural Networks