This is a Triton implementation of the Flash Attention v2 algorithm from Tri Dao (https://tridao.me/publications/flash2/flash2.pdf) ...
Tri Dao Flash Attention v2 的 Triton 实现 (https://tridao.me/publications/flash2/flash2.pdf) ...