What are the best open-source alternatives to Mistral Inference?

30 open-source projects similar to mistralai/mistral-inference, ranked by shared features. Top picks: google/gemma_pytorch, zai-org/glm-4, mistralai/mistral-src, macpaw/openai, cocktailpeanut/dalai, strands-agents/sdk-python, jaymody/picogpt, pytorch/executorch, eleutherai/gpt-neox, fastai/course22.

Is google/gemma_pytorch a good alternative to Mistral Inference?

The official PyTorch implementation of Google's Gemma models

Is zai-org/glm-4 a good alternative to Mistral Inference?

GLM-4 is a large language model and fine-tuning framework designed for human-like text production, complex reasoning, and multilingual conversation. It functions as a multimodal system capable of processing high-resolution visual content and as a long-context model designed to analyze documents wit…

Is mistralai/mistral-src a good alternative to Mistral Inference?

This project is a large language model inference library and framework designed to run models for text generation, problem solving, and coding assistance. It includes a multimodal framework for processing combined image and text inputs and a tool-use implementation that enables the execution of ext…

Is macpaw/openai a good alternative to Mistral Inference?

This is an asynchronous Swift client library for calling OpenAI’s API across Apple platforms. It provides native access to chat completions, image generation and editing, speech synthesis and transcription, text embeddings, and content moderation through a single interface built on Swift’s async-aw…

Is cocktailpeanut/dalai a good alternative to Mistral Inference?

The simplest way to run LLaMA on your local machine

Is strands-agents/sdk-python a good alternative to Mistral Inference?

This is an open-source Python SDK for building and orchestrating production-grade AI agents. It provides a unified framework for creating conversational agents that can use tools, maintain state, and coordinate across multiple language model providers including OpenAI, Anthropic, Google, Amazon Bed…

Is jaymody/picogpt a good alternative to Mistral Inference?

picoGPT is a lightweight, low-level runtime environment and inference engine designed to load pre-trained checkpoints and execute generative transformer model inference. It provides a minimal implementation of the generative pre-trained transformer architecture to facilitate local language model ex…

Is pytorch/executorch a good alternative to Mistral Inference?

ExecuTorch is a lightweight C++ runtime for deploying PyTorch models on mobile, embedded, and edge hardware. It provides an ahead-of-time compilation pipeline that exports, quantizes, and lowers model graphs into compact serialized programs, then executes them through a minimal runtime with hardwar…

Is eleutherai/gpt-neox a good alternative to Mistral Inference?

gpt-neox is a distributed training system and framework for building large-scale autoregressive language models. It implements the transformer architecture and provides a toolkit for training models with billions of parameters by distributing weights across compute clusters. The framework distingu…

Is fastai/course22 a good alternative to Mistral Inference?

This is a structured deep learning curriculum for programmers, delivered as a collection of Jupyter notebooks. It teaches the fundamentals of training neural networks for computer vision, natural language processing, tabular data analysis, and collaborative filtering using PyTorch and the fastai li…

Back to mistralai/mistral-inference

Open-source alternatives to Mistral Inference

30 open-source projects similar to mistralai/mistral-inference, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Mistral Inference alternative.

google/gemma_pytorch
google/gemma_pytorch
5,697View on GitHub
The official PyTorch implementation of Google's Gemma models
Pythongemmagooglepytorch
View on GitHub5,697
zai-org/glm-4
zai-org/GLM-4
7,058View on GitHub
GLM-4 is a large language model and fine-tuning framework designed for human-like text production, complex reasoning, and multilingual conversation. It functions as a multimodal system capable of processing high-resolution visual content and as a long-context model designed to analyze documents with a context window of up to one million tokens. The project differentiates itself through a function calling interface that enables AI agent development by connecting the model to external APIs and real-time web browsing. It includes specialized capabilities for generating functional programming cod
Pythonchatglmchatglm-6bglm
View on GitHub7,058
mistralai/mistral-src
mistralai/mistral-src
10,821View on GitHub
This project is a large language model inference library and framework designed to run models for text generation, problem solving, and coding assistance. It includes a multimodal framework for processing combined image and text inputs and a tool-use implementation that enables the execution of external functions based on model reasoning. The system features a distributed GPU inference engine that spreads large model workloads across multiple graphics processors to increase processing speed and meet memory requirements. It also provides containerized model deployment through pre-packaged imag
Jupyter Notebook
View on GitHub10,821
macpaw/openai
MacPaw/OpenAI
2,862View on GitHub
This is an asynchronous Swift client library for calling OpenAI’s API across Apple platforms. It provides native access to chat completions, image generation and editing, speech synthesis and transcription, text embeddings, and content moderation through a single interface built on Swift’s async-await concurrency model. The client supports structured output generation by constraining model responses to a provided JSON schema, and enables real-time consumption of generated text through streaming responses delivered as an AsyncSequence. It includes a thread-based conversation model for managing
Swiftaiopenaiopenai-api
View on GitHub2,862

Open-source alternatives to Mistral Inference

google/gemma_pytorch

zai-org/GLM-4

mistralai/mistral-src

MacPaw/OpenAI

cocktailpeanut/dalai

strands-agents/sdk-python

jaymody/picoGPT

pytorch/executorch

EleutherAI/gpt-neox

fastai/course22

OpenNMT/CTranslate2

facebookresearch/llama

NVIDIA-NeMo/Guardrails

ModelTC/LightLLM

guardrails-ai/guardrails

BerriAI/litellm

VoltAgent/voltagent

katanemo/archgw

NVIDIA/Isaac-GR00T

firebase/firebase-ios-sdk

claude-code-best/claude-code

baichuan-inc/Baichuan-7B

meta-llama/llama3

naklecha/llama3-from-scratch

QwenLM/Qwen

InternLM/InternLM

QwenLM/Qwen3

sgl-project/sglang

EleutherAI/lm-evaluation-harness

karpathy/llama2.c