What are the best open-source alternatives to TinyLlama?

30 open-source projects similar to jzhang38/tinyllama, ranked by shared features. Top picks: lightning-ai/litgpt, karpathy/llm.c, deepseek-ai/deepseek-llm, d2l-ai/d2l-en, karpathy/nanogpt, artidoro/qlora, microsoft/deepspeed, qwenlm/qwen-7b, stability-ai/stablelm, lm-sys/fastchat.

Is lightning-ai/litgpt a good alternative to TinyLlama?

LitGPT is a training and deployment framework for large language models, providing a suite of tools for pretraining, finetuning, quantizing, evaluating, and serving models within a production environment. It includes a dedicated training pipeline for adapting pretrained models to specific tasks, a…

Is karpathy/llm.c a good alternative to TinyLlama?

This project is a low-dependency engine designed for training large language models using native C and CUDA. It provides a bare-metal environment for tensor computation, allowing for the execution of neural network operations directly on hardware accelerators without the overhead of high-level soft…

Is deepseek-ai/deepseek-llm a good alternative to TinyLlama?

DeepSeek-LLM is a large language model and causal language model designed for natural language generation. It functions as a multi-lingual system capable of predicting the next token in a sequence to perform text completion and conversational generation. The model is specialized for logical reason…

Is d2l-ai/d2l-en a good alternative to TinyLlama?

This project is an educational platform and research toolkit designed to teach deep learning through a combination of mathematical theory, visual diagrams, and executable code. It provides a comprehensive environment for building, training, and evaluating neural networks, grounding complex concepts…

Is karpathy/nanogpt a good alternative to TinyLlama?

nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to proces…

Is artidoro/qlora a good alternative to TinyLlama?

This project is a quantized fine-tuning framework for large language models. It implements a low-rank adaptation library and a four-bit quantizer to reduce the GPU memory requirements needed to train large models. The framework utilizes four-bit quantization and low-rank adapters to enable model t…

Is microsoft/deepspeed a good alternative to TinyLlama?

DeepSpeed is a distributed deep learning optimization library and framework designed for the training and inference of massive AI models. It serves as a model parallelism orchestrator and a toolkit for scaling large language models across multiple GPUs and compute nodes. The project distinguishes…

Is qwenlm/qwen-7b a good alternative to TinyLlama?

Qwen-7B is a pretrained causal language model designed for natural language generation, text processing, and complex reasoning tasks. It is available as an instruction-tuned model optimized for conversational interactions and a tool-use model capable of executing function calls and interacting with…

Is stability-ai/stablelm a good alternative to TinyLlama?

StableLM is a pre-trained transformer-based large language model designed for natural language generation and zero-shot inference. It functions as a causal language model that predicts the next token in a sequence to produce human-like text for conversational and creative writing tasks. The model…

Is lm-sys/fastchat a good alternative to TinyLlama?

FastChat is a training and serving platform for large language models that provides an integrated toolkit for fine-tuning, hosting, and benchmarking chatbots. It functions as an inference server capable of hosting multiple models and exposing them via a standardized API for chat applications. The…

Back to jzhang38/tinyllama

Open-source alternatives to TinyLlama

30 open-source projects similar to jzhang38/tinyllama, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best TinyLlama alternative.

lightning-ai/litgpt
Lightning-AI/litgpt
13,431View on GitHub
LitGPT is a training and deployment framework for large language models, providing a suite of tools for pretraining, finetuning, quantizing, evaluating, and serving models within a production environment. It includes a dedicated training pipeline for adapting pretrained models to specific tasks, a quantization tool for reducing weight precision, and an inference server for hosting models via web interfaces. The framework supports high-performance model development through custom architecture implementation and the use of predefined recipes to standardize pretraining and finetuning. It enables
Python
View on GitHub13,431
karpathy/llm.c
karpathy/llm.c
30,230View on GitHub
This project is a low-dependency engine designed for training large language models using native C and CUDA. It provides a bare-metal environment for tensor computation, allowing for the execution of neural network operations directly on hardware accelerators without the overhead of high-level software abstractions. The framework distinguishes itself by implementing manual gradient backpropagation and custom hardware-specific kernels, providing granular control over memory mapping and computational precision. It supports distributed training across multiple graphics processors and compute nod
Cuda
View on GitHub30,230
deepseek-ai/deepseek-llm
deepseek-ai/deepseek-LLM
7,100View on GitHub
DeepSeek-LLM is a large language model and causal language model designed for natural language generation. It functions as a multi-lingual system capable of predicting the next token in a sequence to perform text completion and conversational generation. The model is specialized for logical reasoning, specifically as a code and math LLM. This enables it to perform complex problem solving, which includes generating executable code and solving mathematical equations through step-by-step analysis. The system's broader capabilities cover conversational AI, including the generation of chat comple
Makefile
View on GitHub7,100

Open-source alternatives to TinyLlama

Lightning-AI/litgpt

karpathy/llm.c

deepseek-ai/deepseek-LLM

d2l-ai/d2l-en

karpathy/nanoGPT

artidoro/qlora

microsoft/DeepSpeed

QwenLM/Qwen-7B

Stability-AI/StableLM

lm-sys/FastChat

databrickslabs/dolly

zyds/transformers-code

PaddlePaddle/ERNIE

liguodongiot/llm-action

lightning-AI/lightning

apple/corenet

d2l-ai/d2l-zh

NVIDIA/NeMo

EleutherAI/gpt-neox

allenai/OLMo

jingyaogong/minimind

nndl/llm-beginner

NVIDIA/Megatron-LM

facebookresearch/metaseq

fastai/course22

datawhalechina/tiny-universe

lucidrains/x-transformers

xlite-dev/LeetCUDA

nomic-ai/gpt4all

MoonshotAI/Kimi-K2