awesome-repositories.comBlog

© 2026 Bringes Technology SRL·VAT RO45896025·hello@awesome-repositories.com

MCP Blog Curated searches Sitemap Privacy Terms

FasterTransformer | Awesome Repository

NVIDIAFasterTransformer

0

View on GitHub↗

6,424 stars·935 forks·C++·Apache-2.0·0 views

FasterTransformer

Features

Inference and Serving - NVIDIA framework for accelerated LLM inference.
Mixture of Experts - Optimizes MoE model execution for cloud-scale production.
Model Quantization Tools - Optimized transformer implementation for cloud-scale production.
Transformer Implementations - Optimized transformer implementation for high-performance inference.

AI search

Explore more awesome repositories

Describe what you need in plain English — the AI ranks thousands of curated open-source projects by relevance.

Start searching with AI

Transformer related optimization, including BERT, GPT