Attention

Rethinking Attention with Performers

Paper-reading notes: Performers

Paper-reading notes: What Formal Languages Can Transformers Express? A Survey

Paper-reading notes: ATLAS

Paper-reading notes: Roformer

Paper-reading notes: ViT

Paper-reading notes: A Bridging Model for Parallel Computation

Paper-reading notes: Attention is All You Need