FlashAttention introduced in paper: FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness (Dao et al., 2022).
Predicate
introduced_in_paper
Object
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness (Dao et al., 2022)
Primary source · preprint · 2022-05-27
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness — arXiv (Dao, Fu, Ermon, Rudra, Ré)