←Backgoogle/flaxformerArchived0Copy as MarkdownView on GitHub↗368 stars·30 forks·Python·Apache-2.0·0 viewsFlaxformerFeaturesAttention Optimization - Generalized multi-query attention for transformer models.