awesome-repositories.com

© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io

MCP Privacy Terms

1 repo

Compute Kernels — DevOps & Infrastructure

We curate 1 GitHub repository matching devops & infrastructure · Compute Kernels. Refine with filters or upvote what's useful.

Compute Kernels — DevOps & Infrastructure

Describe the repository you're looking for…

We'll search the best matching repositories with AI.

vllm-project/vllm
vllm-project/vllm
70,745GitHubView on GitHub
vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen
Pythonamdblackwellcuda