What Formal Languages Can Transformers Express? A Survey
November 30, 2025 | 327 words | Author: Tan Ke
ATLAS: Learning to Optimally Memorize the Context at Test Time
November 29, 2025 | 628 words | Author: Tan Ke
Solving olympiad geometry without human demonstrations
November 28, 2025 | 522 words | Author: Tan Ke
Formal Mathematical Reasoning A New Frontier in AI
November 27, 2025 | 347 words | Author: Tan Ke
Titans: Learning to Memorize at Test Time
November 26, 2025 | 916 words | Author: Tan Ke
Roformer: Enhanced Transformer With Rotary Position Embedding
November 25, 2025 | 348 words | Author: Tan Ke
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
November 24, 2025 | 360 words | Author: Tan Ke
Mastering the game of Go without human knowledge
November 24, 2025 | 342 words | Author: Tan Ke
Disentangling Light Fields for Super-Resolution and Disparity Estimation
November 19, 2025 | 1379 words | Author: Tan Ke
Hyena Hierarchy: Towards Larger Convolutional Language Models
November 18, 2025 | 516 words | Author: Tan Ke
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
November 17, 2025 | 397 words | Author: Tan Ke
A survey for light field super-resolution
November 14, 2025 | 341 words | Author: Tan Ke
Efficiently Modeling Long Sequences with Structured State Spaces
November 11, 2025 | 930 words | Author: Tan Ke
Retentive Network: A Successor to Transformer for Large Language Models
November 11, 2025 | 472 words | Author: Tan Ke
Exploiting Spatial and Angular Correlations With Deep Efficient Transformers for Light Field Image Super-Resolution
November 10, 2025 | 1071 words | Author: Tan Ke
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation
November 9, 2025 | 1312 words | Author: Tan Ke
Reference-Based Face Super-Resolution Using the Spatial Transformer
November 7, 2025 | 428 words | Author: Tan Ke
LMR: A Large-Scale Multi-Reference Dataset for Reference-based Super-Resolution
November 7, 2025 | 1157 words | Author: Tan Ke
Latent Diffusion Models
November 6, 2025 | 964 words | Author: Tan Ke
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
November 4, 2025 | 2299 words | Author: Tan Ke
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
November 3, 2025 | 1851 words | Author: Tan Ke
A Tutorial on Bayesian Optimization
November 1, 2025 | 3591 words | Author: Tan Ke