### Feature request See https://pytorch.org/blog/flash-decoding/#:~:text=Flash%2DDecoding%20works%20in%203,exp%20of%20the%20attention%20values. ### Motivation Flash decoding further improves attention mechanism compared to FlashAttention V2 on long context ### Your contribution None