Learning Transferable Visual Models From Natural Language Supervision

Paper-reading notes: CLIP
January 1, 2026 | 888 words | Author: Tan Ke

OpenVLA: An Open-Source Vision-Language-Action Model

Paper-reading notes: OpenVLA
December 12, 2025 | 312 words | Author: Tan Ke