# intel/ipex-llm

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/intel-ipex-llm).**

8,836 stars · 1,427 forks · Python · Apache-2.0 · archived

## Links

- GitHub: https://github.com/intel/ipex-llm
- awesome-repositories: https://awesome-repositories.com/repository/intel-ipex-llm.md

## Description

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

## Tags

### Part of an Awesome List

- [Model Serving & Deployment](https://awesome-repositories.com/f/awesome-lists/ai/model-serving-deployment.md) — Runs LLMs on Intel hardware with low latency.
