Posts by Zewei Tao

Attention Engine for Inference

The upcoming blog post will be released in the near future. Stay tuned!

Read more ...


Support Blackwell with FFA_FA4 Backend

The upcoming blog post will be released in the near future. Stay tuned!

Read more ...


Optimize Sparse Attention in FFA

The upcoming blog post will be released in the near future. Stay tuned!

Read more ...


Support Native Group Collective Based on DeepEP

The upcoming blog post will be released in the near future. Stay tuned!

Read more ...


Dynamic Attention Solver

The upcoming blog post will be released in the near future. Stay tuned!

Read more ...


Long-Context Attention Benchmark

From Kernel Efficiency to Distributed Scalability

Read more ...


MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Mask Training

Read more ...