# microsoft/vattention

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/microsoft-vattention).**

495 stars · 42 forks · C · MIT

## Links

- GitHub: https://github.com/microsoft/vattention
- awesome-repositories: https://awesome-repositories.com/repository/microsoft-vattention.md

## Description

Dynamic Memory Management for Serving LLMs without PagedAttention

## Tags

### Part of an Awesome List

- [Inference Serving Engines](https://awesome-repositories.com/f/awesome-lists/ai/inference-serving-engines.md) — Dynamic memory management for serving without paged attention.
