Posts tagged HSTU Function Representation

Support Blackwell with FFA_FA4 Backend

07 February 2026

Before the release of MagiAttention-v1.1.0, MagiAttention had supported only the Hopper GPUs, since the attention kernel backend Flex-Flash-Attention (FFA) is built upon open-sourced Flash-Attention 3 (FA3) [Shah et al., 2024], tailored for SM90 compute capability.

Read more ...

Recent Posts

Tags

Categories

Archives

Authors

Locations

Posts tagged HSTU Function Representation

Support Blackwell with FFA_FA4 Backend