Learning Transformer Programs

Paper-reading notes: Learning Transformer Programs
December 15, 2025 · 339 words

ALTA: Compiler-Based Analysis of Transformers

Paper-reading notes: ALTA
December 9, 2025 · 720 words

Tracr: Compiled Transformers as a Laboratory for Interpretability

Paper-reading notes: Tracr
December 8, 2025 · 59 words

Thinking Like Transformers

Paper-reading notes: RASP
December 7, 2025 · 273 words

On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning

Paper-reading notes: On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning
December 1, 2025 · 462 words

What Formal Languages Can Transformers Express? A Survey

Paper-reading notes: What Formal Languages Can Transformers Express? A Survey
November 30, 2025 · 327 words

Solving olympiad geometry without human demonstrations

Paper-reading notes: AlphaGeometry
November 28, 2025 · 522 words

Formal Mathematical Reasoning A New Frontier in AI

Paper-reading notes: Formal Mathematical Reasoning A New Frontier in AI
November 27, 2025 · 347 words

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper-reading notes: DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
November 4, 2025 · 2299 words

Mastering the game of Go with MCTS and Deep Neural Networks

Paper-reading notes: Mastering the game of Go with MCTS and Deep Neural Networks
October 24, 2025 · 2246 words