# lmcache/lmcache

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/lmcache-lmcache).**

6,909 stars · 904 forks · Python · apache-2.0

## Links

- GitHub: https://github.com/LMCache/LMCache
- Homepage: https://lmcache.ai/
- awesome-repositories: https://awesome-repositories.com/repository/lmcache-lmcache.md

## Topics

`amd` `cuda` `fast` `inference` `kv-cache` `llm` `pytorch` `rocm` `speed` `vllm`

## Tags

### Part of an Awesome List

- [KV Cache Management](https://awesome-repositories.com/f/awesome-lists/ai/kv-cache-management.md) — Fast context loading and knowledge fusion for LLMs.
- [Memory Management](https://awesome-repositories.com/f/awesome-lists/ai/memory-management.md) — KV cache layer for accelerating LLM inference.
- [Model Serving & Deployment](https://awesome-repositories.com/f/awesome-lists/ai/model-serving-deployment.md) — Accelerates LLM inference via KV cache management.
