Llm Scaler

LLM Scaler is an GenAI solution for text generation, image generation, video generation etc. running on Intel® Arc™ Pro B60 and B70 GPUs. LLM Scalar leverages standard frameworks such as vLLM, ComfyUI, SGLang Diffusion, Xinference etc and ensures the best performance for State-of-Art GenAI…

ai-dynamo/dynamo

6,112View on GitHub

Dynamo is a distributed inference orchestration platform designed for large language models. It functions as a system to coordinate prefill and decode phases across GPU nodes, utilizing a multi-backend runtime adapter to connect engines like vLLM and TensorRT-LLM through a unified block-oriented memory interface. An OpenAI-compatible API server provides the frontend for integration with existing tools and clients. The project is distinguished by its disaggregated serving architecture, which separates prompt processing and token generation onto independent GPU pools to optimize throughput and

alexrozanski/LlamaChat

1,510View on GitHub

Chat with your favourite LLaMA models in a native macOS app

aphrodite-engine/aphrodite-engine

1,771View on GitHub

abetlen/llama-cpp-python

9,993View on GitHub

llama-cpp-python provides a Python interface for the llama.cpp library, enabling the execution of large language models with hardware acceleration. It functions as a GGUF model loader and a structured text generator capable of running inference servers and multimodal runtimes for processing both text and image inputs. The project distinguishes itself through a local inference server that exposes model capabilities via an OpenAI-compatible web API. It supports advanced execution techniques including speculative decoding, weight quantization, and layer-based GPU offloading to manage memory acro

ai-dynamo/dynamo

6,112View on GitHub

alexrozanski/LlamaChat

1,510View on GitHub

Chat with your favourite LLaMA models in a native macOS app

aphrodite-engine/aphrodite-engine

1,771View on GitHub

abetlen/llama-cpp-python

9,993View on GitHub

intelllm-scaler

Features

Open-source alternatives to Llm Scaler

ai-dynamo/dynamo

alexrozanski/LlamaChat

aphrodite-engine/aphrodite-engine

abetlen/llama-cpp-python

Star history

Open-source alternatives to Llm Scaler

ai-dynamo/dynamo

alexrozanski/LlamaChat

aphrodite-engine/aphrodite-engine

abetlen/llama-cpp-python