Posts tagged Attention Sink

17 November 2025 - Support Learnable Attention Sink

Posts tagged Attention Slice Representation

21 April 2025 - MagiAttention

Posts tagged Benchmark

19 October 2025 - Long-Context Attention Benchmark

Posts tagged Blackwell

07 February 2026 - Support Blackwell with FFA_FA4 Backend

19 October 2025 - Long-Context Attention Benchmark

Posts tagged Computation Load-Balance

21 April 2025 - MagiAttention

Posts tagged DSA

25 January 2026 - Optimize Sparse Attention in FFA (Coming Soon)

Posts tagged DeepEP

24 January 2026 - Support Native Group Collective

Posts tagged Group Collective

24 January 2026 - Support Native Group Collective

21 April 2025 - MagiAttention

Posts tagged Multi-Stage Overlap

21 April 2025 - MagiAttention

Posts tagged Muon

04 February 2026 - Support Muon QK-Clip

Posts tagged NSA

25 January 2026 - Optimize Sparse Attention in FFA (Coming Soon)

Posts tagged QK-Clip

04 February 2026 - Support Muon QK-Clip

Posts tagged Zero-Redundant Communication

21 April 2025 - MagiAttention