What are the best open-source alternatives to Dalai?

30 open-source projects similar to cocktailpeanut/dalai, ranked by shared features. Top picks: pytorch/executorch, mistralai/mistral-inference, llmware-ai/llmware, llmquant/quant-wiki, mozilla-ocho/llamafile, jundot/omlx, getumbrel/llama-gpt, n4ze3m/page-assist, thudm/chatglm-6b, lostruins/koboldcpp.

Is pytorch/executorch a good alternative to Dalai?

ExecuTorch is a lightweight C++ runtime for deploying PyTorch models on mobile, embedded, and edge hardware. It provides an ahead-of-time compilation pipeline that exports, quantizes, and lowers model graphs into compact serialized programs, then executes them through a minimal runtime with hardwar…

Is mistralai/mistral-inference a good alternative to Dalai?

Mistral Inference is a library for running Mistral large language models on a GPU, generating text from prompts with token streaming. It loads pretrained model weights from local disk or a remote registry into GPU memory, then produces output tokens one by one for real-time display in interactive a…

Is llmware-ai/llmware a good alternative to Dalai?

llmware is a Python framework for AI agent orchestration and model management, designed to coordinate multi-model workflows and autonomous agents. It provides a unified model catalog and standardized interface to execute specialized language models for complex research, analysis, and structured dat…

Is llmquant/quant-wiki a good alternative to Dalai?

quant-wiki is a comprehensive knowledge base and structured reference for quantitative finance, financial engineering, and algorithmic trading. It serves as a centralized library of documentation covering mathematical models, financial instruments, and systematic trading strategies. The project in…

Is mozilla-ocho/llamafile a good alternative to Dalai?

llamafile is a model bundler and local runtime that packages large language models and their execution logic into single, portable executable files. It provides a distribution format for zero-installation local execution, allowing users to run models on various operating systems without managing ex…

Is jundot/omlx a good alternative to Dalai?

omlx is a local inference server designed to run large language models, vision models, and embedding models on Apple Silicon. It provides a private alternative to industry-standard AI endpoints by hosting a local API gateway that mirrors OpenAI and Anthropic specifications. The system distinguishe…

Is getumbrel/llama-gpt a good alternative to Dalai?

Llama-GPT is a self-hosted generative AI model runner that provides a private web interface for interacting with large language models. By executing these models directly on local hardware, it ensures that all intelligent assistance remains offline and independent of external cloud service provider…

Is n4ze3m/page-assist a good alternative to Dalai?

Page Assist is a browser-based AI integration tool that provides a sidebar interface for interacting with AI models while browsing the web. It focuses on privacy-focused chatting and web content analysis, allowing users to extract and query information from active webpages to receive context-aware…

Is thudm/chatglm-6b a good alternative to Dalai?

ChatGLM-6B is an open-source bilingual large language model designed for natural dialogue and text generation in both English and Chinese. It is structured as a dialogue model capable of tasks such as role-playing and information extraction. The project provides implementations for quantized langu…

Is lostruins/koboldcpp a good alternative to Dalai?

KoboldCPP is a local large language model inference engine and GGUF model runner designed to execute quantized models on personal hardware. It functions as a multimodal AI server and API gateway, providing OpenAI-compatible endpoints that allow third-party clients to interact with locally hosted mo…

Back to cocktailpeanut/dalai

Open-source alternatives to Dalai

30 open-source projects similar to cocktailpeanut/dalai, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Dalai alternative.

pytorch/executorch
pytorch/executorch
4,296View on GitHub
ExecuTorch is a lightweight C++ runtime for deploying PyTorch models on mobile, embedded, and edge hardware. It provides an ahead-of-time compilation pipeline that exports, quantizes, and lowers model graphs into compact serialized programs, then executes them through a minimal runtime with hardware acceleration and on-device large language model inference capabilities. The project distinguishes itself through a hardware accelerator delegate system that partitions model subgraphs and offloads computation to specialized backends including NPUs, GPUs, and DSPs from Apple, Arm, Intel, MediaTek,
Pythondeep-learningembeddedgpu
View on GitHub4,296
mistralai/mistral-inference
mistralai/mistral-inference
10,819View on GitHub
Mistral Inference is a library for running Mistral large language models on a GPU, generating text from prompts with token streaming. It loads pretrained model weights from local disk or a remote registry into GPU memory, then produces output tokens one by one for real-time display in interactive applications. The library supports multimodal prompts that accept image URLs alongside text, enabling visual description and reasoning. It includes content safety guardrails that scan generated text against predefined policies to block or flag policy violations. For structured interactions, it provid
Jupyter Notebookllmllm-inferencemistralai
View on GitHub10,819
llmware-ai/llmware
llmware-ai/llmware
14,838View on GitHub
llmware is a Python framework for AI agent orchestration and model management, designed to coordinate multi-model workflows and autonomous agents. It provides a unified model catalog and standardized interface to execute specialized language models for complex research, analysis, and structured data generation. The project distinguishes itself through its heavy emphasis on local execution and quantized inference, allowing models to run on private infrastructure using CPU, GPU, and NPU acceleration via runtimes like ONNX and OpenVino. It features a specialized ability to translate natural lang
Python
View on GitHub14,838

Open-source alternatives to Dalai

pytorch/executorch

mistralai/mistral-inference

llmware-ai/llmware

LLMQuant/quant-wiki

Mozilla-Ocho/llamafile

jundot/omlx

getumbrel/llama-gpt

n4ze3m/page-assist

THUDM/ChatGLM-6B

LostRuins/koboldcpp

OpenBMB/MiniCPM

RunanywhereAI/runanywhere-sdks

ggerganov/llama.cpp

kserve/kserve

mlc-ai/web-llm

strands-agents/sdk-python

Kilo-Org/kilocode

PromtEngineer/localGPT

QuentinFuxa/WhisperLiveKit

yakami129/VirtualWife

serge-chat/serge

dusty-nv/jetson-inference

TheR1D/shell_gpt

jina-ai/node-DeepResearch

docker/genai-stack

jaymody/picoGPT

menloresearch/jan

ollama/ollama-python

InternLM/lmdeploy

Arthur-Ficial/apfel