Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

Paper-reading notes: AlphaZero
November 24, 2025 · 360 words

Mastering the game of Go without human knowledge

Paper-reading notes: AlphaGo Zero
November 24, 2025 · 342 words

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper-reading notes: DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
November 4, 2025 · 2299 words

Mastering the game of Go with MCTS and Deep Neural Networks

Paper-reading notes: Mastering the game of Go with MCTS and Deep Neural Networks
October 24, 2025 · 2246 words