What are the best open-source alternatives to Gemma?

30 open-source projects similar to google-deepmind/gemma, ranked by shared features. Top picks: nndl/llm-beginner, qwenlm/qwen-7b, thudm/chatglm3, thudm/glm-4, thudm/chatglm2-6b, pytorch/torchtune, nvidia/isaac-gr00t, openai/gpt-oss, bitsandbytes-foundation/bitsandbytes, deep-floyd/if.

Is nndl/llm-beginner a good alternative to Gemma?

This project is a collection of educational resources and technical guides focused on the development and implementation of large language models. It provides a comprehensive curriculum covering transformer architectures, training methods, and deployment strategies. The materials provide detailed…

Is qwenlm/qwen-7b a good alternative to Gemma?

Qwen-7B is a pretrained causal language model designed for natural language generation, text processing, and complex reasoning tasks. It is available as an instruction-tuned model optimized for conversational interactions and a tool-use model capable of executing function calls and interacting with…

Is thudm/chatglm3 a good alternative to Gemma?

ChatGLM3 is an open-weights large language model designed for bilingual conversational interactions in English and Chinese. It functions as a tool-augmented system capable of calling external functions and executing internal code to resolve complex tasks. The model utilizes four-bit quantization t…

Is thudm/glm-4 a good alternative to Gemma?

GLM-4 is an open weights large language model designed as a multimodal chat system. It functions as a reasoning-focused and multilingual model capable of processing and generating responses across text and visual data types. The model is distinguished by its function-calling capabilities, allowing…

Is thudm/chatglm2-6b a good alternative to Gemma?

ChatGLM2-6B is an open-weight large language model designed for natural language conversations and text generation in both English and Chinese. It functions as a bilingual chat model capable of processing and maintaining coherence across text sequences up to 32K tokens. The model is optimized for…

Is pytorch/torchtune a good alternative to Gemma?

Torchtune is a PyTorch-native library for fine-tuning, aligning, and quantizing large language models. It provides a configurable training pipeline orchestrated through YAML recipes, with CLI overrides and component swapping, distributed training via FSDP2, memory optimizations, and parameter-effic…

Is nvidia/isaac-gr00t a good alternative to Gemma?

nvidia/isaac-gr00t is an open-source alternative to Gemma.

Is openai/gpt-oss a good alternative to Gemma?

gpt-oss is an open-weight large language model and reasoning engine designed for complex reasoning and agentic workflows. It functions as an AI agent framework and model serving API, allowing for local deployment and the hosting of standardized interfaces to expose model completions and internal re…

Is bitsandbytes-foundation/bitsandbytes a good alternative to Gemma?

bitsandbytes is a deep learning quantization tool and library designed to reduce the memory footprint of large language models. It serves as a GPU memory optimizer and quantization framework, compressing model weights and features to 8-bit and 4-bit precision to enable inference and training on har…

Is deep-floyd/if a good alternative to Gemma?

IF is a text-to-image diffusion system that translates natural language descriptions into visual imagery. The project provides a generative pipeline for creating images, an inpainting tool for modifying specific image sections, and a super-resolution upscaler to increase pixel density and clarity.…

Back to google-deepmind/gemma

Open-source alternatives to Gemma

30 open-source projects similar to google-deepmind/gemma, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Gemma alternative.

nndl/llm-beginner
nndl/llm-beginner
6,421View on GitHub
This project is a collection of educational resources and technical guides focused on the development and implementation of large language models. It provides a comprehensive curriculum covering transformer architectures, training methods, and deployment strategies. The materials provide detailed instructions for building autonomous agents using reasoning loops and tool integration, as well as guides for fine-tuning models through supervised learning and preference optimization. It also includes tutorials for constructing retrieval augmented generation pipelines and implementing transformer m
Pythonagentfudannlpllm
View on GitHub6,421
qwenlm/qwen-7b
QwenLM/Qwen-7B
21,343View on GitHub
Qwen-7B is a pretrained causal language model designed for natural language generation, text processing, and complex reasoning tasks. It is available as an instruction-tuned model optimized for conversational interactions and a tool-use model capable of executing function calls and interacting with external APIs. The project provides a quantized version of the model to reduce GPU memory usage and supports the development of autonomous agents that can execute code and perform functions to complete complex goals. The system covers a wide range of capabilities including model fine-tuning throug
Python
View on GitHub21,343
thudm/chatglm3
THUDM/ChatGLM3
13,676View on GitHub
ChatGLM3 is an open-weights large language model designed for bilingual conversational interactions in English and Chinese. It functions as a tool-augmented system capable of calling external functions and executing internal code to resolve complex tasks. The model utilizes four-bit quantization to reduce memory requirements, enabling inference on consumer hardware and diverse processing units including GPUs and CPUs. It features an expanded context window for processing and summarizing long documents and includes a supervised fine-tuning pipeline for adapting the model to specialized domains
Python
View on GitHub13,676

Open-source alternatives to Gemma

nndl/llm-beginner

QwenLM/Qwen-7B

THUDM/ChatGLM3

THUDM/GLM-4

THUDM/ChatGLM2-6B

pytorch/torchtune

NVIDIA/Isaac-GR00T

openai/gpt-oss

bitsandbytes-foundation/bitsandbytes

deep-floyd/IF

OpenBMB/MiniCPM

meta-pytorch/torchtune

ml-explore/mlx-examples

NVlabs/Sana

datawhalechina/tiny-universe

OpenNMT/OpenNMT-py

h2oai/h2o-llmstudio

zai-org/CogVLM

TingsongYu/PyTorch_Tutorial

facebookresearch/fairseq

meta-llama/llama-models

microsoft/vscode-copilot-chat

microsoft/unilm

LostRuins/koboldcpp

skyzh/tiny-llm

google-research/big_vision

deepseek-ai/DeepSeek-VL2

decodingai-magazine/llm-twin-course

facebookresearch/metaseq

imoneoi/openchat