←Backthu-ml/SageAttention0Copy as MarkdownView on GitHub↗3,425 stars·434 forks·Cuda·Apache-2.0·0 viewsarxiv.org/abs/2410.02367↗SageAttentionFeaturesAttention Optimization - Accurate 8-bit attention for plug-and-play acceleration.