InSpatio-WorldFM: An Open-Source Real-Time Generative Frame Model

Paper-reading notes: InSpatio-WorldFM
March 24, 2026 | 775 words | Author: Tan Ke

Evolution and Ablation of Robotic World Models

World Models Paper | Homepage There is an interactive loop between the agent and the environment. The agent observes the environment, takes an action in response, and then the environment changes accordingly. The agent model can be viewed as the brain of the agent: it is the overall decision-making system that enables the agent to perceive the environment, maintain temporal context, and choose actions. An typical agent model has three components, three models: ...

March 22, 2026 | 5346 words | Author: Tan Ke

A Review of Robbyant’s Early-2026 Work

Robbyant is a company under Ant Group, dedicated to building the foundational platform for Embodied AI, bridging the gap between digital intelligence and the physical world. Since the company is still relatively new, I want to quickly review its recent work. In particular, I will study four embodied intelligence model models: spatial perception model, VLA model, world model, and video action model. This diagram in the homepage of Robbyant reflects the vision for embodied intelligence: starting from sensory input, the system first builds spatial intelligence to understand the physical world, then relies on an action model to make decisions and interact with the environment, and finally improves through environmental reward. ...

March 16, 2026 | 3594 words | Author: Tan Ke

ATLAS: Learning to Optimally Memorize the Context at Test Time

Paper-reading notes: ATLAS
November 29, 2025 | 628 words | Author: Tan Ke

Mastering the game of Go without human knowledge

Paper-reading notes: AlphaGo Zero
November 24, 2025 | 342 words | Author: Tan Ke