DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper-reading notes: DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
November 4, 2025 · 2299 words