What are the best open-source alternatives to Gemma Pytorch?

30 open-source projects similar to google/gemma_pytorch, ranked by shared features. Top picks: mistralai/mistral-inference, cstankonrad/long_llama, meta-llama/llama-recipes, zai-org/glm-4, ymcui/chinese-llama-alpaca-2, modeltc/lightllm, huggingface/text-embeddings-inference, macpaw/openai, qwenlm/qwen2.5, zihangdai/xlnet.

Is mistralai/mistral-inference a good alternative to Gemma Pytorch?

Mistral Inference is a library for running Mistral large language models on a GPU, generating text from prompts with token streaming. It loads pretrained model weights from local disk or a remote registry into GPU memory, then produces output tokens one by one for real-time display in interactive a…

Is cstankonrad/long_llama a good alternative to Gemma Pytorch?

Long Llama is a transformer-based language model and fine-tuning framework designed to process and maintain logical coherence across input sequences that significantly exceed standard length limits. By utilizing a focused transformer architecture, the project enables models to handle massive docume…

Is meta-llama/llama-recipes a good alternative to Gemma Pytorch?

This project is a collection of reference implementations and recipes for deploying, fine-tuning, and running inference with Llama large language models. It serves as a toolkit and implementation guide for adapting pre-trained models to specific tasks and domain-specific datasets. The repository p…

Is zai-org/glm-4 a good alternative to Gemma Pytorch?

GLM-4 is a large language model and fine-tuning framework designed for human-like text production, complex reasoning, and multilingual conversation. It functions as a multimodal system capable of processing high-resolution visual content and as a long-context model designed to analyze documents wit…

Is ymcui/chinese-llama-alpaca-2 a good alternative to Gemma Pytorch?

This project provides a Chinese large language model based on the LLaMA architecture. It is an instruction-tuned model optimized for natural language processing and multi-turn conversations in Chinese. The system includes a framework for parameter-efficient fine-tuning using low-rank adaptation an…

Is modeltc/lightllm a good alternative to Gemma Pytorch?

LightLLM is a high-performance serving framework for deploying and executing large language models. It functions as a multi-GPU inference engine and server capable of handling dense architectures, mixture-of-experts designs, and multimodal models that process both text and images. The system is di…

Is huggingface/text-embeddings-inference a good alternative to Gemma Pytorch?

Text Embeddings Inference is a high-performance inference server designed to host text embedding and sequence classification models as scalable API endpoints. It provides a vector embedding API to convert text into dense representations and a cross-encoder reranking server for scoring the relevance…

Is macpaw/openai a good alternative to Gemma Pytorch?

This is an asynchronous Swift client library for calling OpenAI’s API across Apple platforms. It provides native access to chat completions, image generation and editing, speech synthesis and transcription, text embeddings, and content moderation through a single interface built on Swift’s async-aw…

Is qwenlm/qwen2.5 a good alternative to Gemma Pytorch?

Qwen2.5 is a suite of large language model foundation models designed for natural language generation, code production, and complex mathematical reasoning. The project encompasses a multilingual language model capable of processing dozens of languages and a specialized code generation model for tec…

Is zihangdai/xlnet a good alternative to Gemma Pytorch?

This project is a natural language processing framework focused on a generalized autoregressive pretrainer designed for unsupervised language representation. It implements a language model that combines permutation-based training with a Transformer-XL backbone to function as a long-context text pro…

Back to google/gemma_pytorch

Open-source alternatives to Gemma Pytorch

30 open-source projects similar to google/gemma_pytorch, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Gemma Pytorch alternative.

mistralai/mistral-inference
mistralai/mistral-inference
10,819View on GitHub
Mistral Inference is a library for running Mistral large language models on a GPU, generating text from prompts with token streaming. It loads pretrained model weights from local disk or a remote registry into GPU memory, then produces output tokens one by one for real-time display in interactive applications. The library supports multimodal prompts that accept image URLs alongside text, enabling visual description and reasoning. It includes content safety guardrails that scan generated text against predefined policies to block or flag policy violations. For structured interactions, it provid
Jupyter Notebookllmllm-inferencemistralai
View on GitHub10,819
cstankonrad/long_llama
CStanKonrad/long_llama
1,465View on GitHub
Long Llama is a transformer-based language model and fine-tuning framework designed to process and maintain logical coherence across input sequences that significantly exceed standard length limits. By utilizing a focused transformer architecture, the project enables models to handle massive documents or entire books by training attention layers to track distant tokens. The framework distinguishes itself through specialized attention mechanisms that allow for the processing of hundreds of thousands of tokens. It incorporates memory-efficient inference techniques, such as key-value caching and
Python
View on GitHub1,465
meta-llama/llama-recipes
meta-llama/llama-recipes
18,379View on GitHub
This project is a collection of reference implementations and recipes for deploying, fine-tuning, and running inference with Llama large language models. It serves as a toolkit and implementation guide for adapting pre-trained models to specific tasks and domain-specific datasets. The repository provides frameworks for developing retrieval augmented generation pipelines to ground model responses in external data. It includes guides for executing quantized inference to reduce memory usage and increase processing speed. The toolkit covers a broad range of capabilities including parameter-effic
Jupyter Notebook
View on GitHub18,379

Open-source alternatives to Gemma Pytorch

mistralai/mistral-inference

CStanKonrad/long_llama

meta-llama/llama-recipes

zai-org/GLM-4

ymcui/Chinese-LLaMA-Alpaca-2

ModelTC/LightLLM

huggingface/text-embeddings-inference

MacPaw/OpenAI

QwenLM/Qwen2.5

zihangdai/xlnet

01-ai/Yi

shibing624/text2vec

THUDM/ChatGLM2-6B

InternLM/InternLM

NVIDIA-NeMo/Guardrails

facebookresearch/ParlAI

QwenLM/CodeQwen1.5

EricLBuehler/mistral.rs

openai-php/client

spring-projects/spring-ai

zai-org/ChatGLM2-6B

NVIDIA/Isaac-GR00T

QwenLM/Qwen-7B

VoltAgent/voltagent

meta-llama/llama-models

katanemo/archgw

crmne/ruby_llm

pytorch/executorch

intel/ipex-llm

zai-org/GLM-4.5