←BackHanGuo97/flute0Copy as MarkdownView on GitHub↗390 stars·19 forks·C++·Apache-2.0·0 viewsarxiv.org/abs/2407.10960↗FluteFeaturesTensor Core Optimization - Fast matrix multiplication kernels for lookup table-quantized language models.