awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPPrivacyTerms

1 repo

Awesome GitHub RepositoriesAttention Backends

Optimized computational backends specifically designed to accelerate the attention mechanisms used in transformer models.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Attention Backends. Refine with filters or upvote what's useful.

Awesome Attention Backends GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • vllm-project/vllm

    vllm-project/vllm

    70,745GitHubView on GitHub↗

    vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen

    Pythonamdblackwellcuda