What are the best open-source alternatives to Starcoder?

30 open-source projects similar to bigcode-project/starcoder, ranked by shared features. Top picks: qwenlm/qwen-7b, meta-llama/llama-models, scir-hi/huatuo-llama-med-chinese, databrickslabs/dolly, huggingface/smollm, meta-pytorch/torchtune, opengvlab/llama-adapter, paddlepaddle/ernie, ymcui/chinese-llama-alpaca, microsoft/nlp-recipes.

Is qwenlm/qwen-7b a good alternative to Starcoder?

Qwen-7B is a pretrained causal language model designed for natural language generation, text processing, and complex reasoning tasks. It is available as an instruction-tuned model optimized for conversational interactions and a tool-use model capable of executing function calls and interacting with…

Is meta-llama/llama-models a good alternative to Starcoder?

This project provides a foundational framework and reference implementation for executing causal language modeling and multimodal reasoning on local systems. It includes a set of core components for managing model assets, a fine-tuning framework, and structural definitions required to instantiate t…

Is scir-hi/huatuo-llama-med-chinese a good alternative to Starcoder?

Huatuo-Llama-Med-Chinese is a medical large language model specialized in processing and generating natural language text in Chinese. It is an instruction-tuned system designed to answer professional healthcare questions by leveraging a dedicated medical knowledge base. The model integrates struct…

Is databrickslabs/dolly a good alternative to Starcoder?

Dolly is an instruction-tuned large language model designed to follow complex natural language directions. It operates as a causal language model that predicts the next token in a sequence to generate coherent conversational responses and perform tasks such as brainstorming, classification, and que…

Is huggingface/smollm a good alternative to Starcoder?

SmolLM is a project dedicated to the development of small language models. It focuses on training and fine-tuning compact models that maintain high performance while utilizing fewer parameters. The project emphasizes efficient AI inference and on-device text generation, aiming to enable the deploy…

Is meta-pytorch/torchtune a good alternative to Starcoder?

Torchtune is a PyTorch-native library for fine-tuning, aligning, and quantizing large language models. It provides a config-driven system for instantiating components, orchestrating distributed training, and managing parameter-efficient fine-tuning with quantization support, all through YAML-based…

Is opengvlab/llama-adapter a good alternative to Starcoder?

LLaMA-Adapter is a parameter-efficient fine-tuning framework designed to adapt large language models using a minimal set of trainable parameters. It functions as an instruction tuning tool and a multimodal adapter, allowing pre-trained models to follow human instructions and process non-textual dat…

Is paddlepaddle/ernie a good alternative to Starcoder?

ERNIE is a development toolkit for training, fine-tuning, and deploying large language models built on the PaddlePaddle deep learning platform. It provides a comprehensive suite of core components, including an inference server for vision and language models, a training and fine-tuning toolkit, and…

Is ymcui/chinese-llama-alpaca a good alternative to Starcoder?

This project is a comprehensive toolkit for adapting large language models to the Chinese language, providing a specialized framework for fine-tuning, inference, and local deployment. It serves as a coordinated suite for language-specific adaptation, including tools for expanding tokenizers and imp…

Is microsoft/nlp-recipes a good alternative to Starcoder?

nlp-recipes is a collection of implementation guides and reference templates for applying natural language processing techniques to real-world tasks. It provides standardized workflows and code examples for developing NLP pipelines, from dataset preparation and model training to performance evaluat…

Back to bigcode-project/starcoder

Open-source alternatives to Starcoder

30 open-source projects similar to bigcode-project/starcoder, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Starcoder alternative.

qwenlm/qwen-7b
QwenLM/Qwen-7B
21,343View on GitHub
Qwen-7B is a pretrained causal language model designed for natural language generation, text processing, and complex reasoning tasks. It is available as an instruction-tuned model optimized for conversational interactions and a tool-use model capable of executing function calls and interacting with external APIs. The project provides a quantized version of the model to reduce GPU memory usage and supports the development of autonomous agents that can execute code and perform functions to complete complex goals. The system covers a wide range of capabilities including model fine-tuning throug
Python
View on GitHub21,343
meta-llama/llama-models
meta-llama/llama-models
7,643View on GitHub
This project provides a foundational framework and reference implementation for executing causal language modeling and multimodal reasoning on local systems. It includes a set of core components for managing model assets, a fine-tuning framework, and structural definitions required to instantiate transformer-based architectures. The system is distinguished by its ability to process combined text and image inputs through multimodal transformer models for visual reasoning and document analysis. It also supports the deployment of quantized models, reducing memory footprints through low-precision
Python
View on GitHub7,643
scir-hi/huatuo-llama-med-chinese
SCIR-HI/Huatuo-Llama-Med-Chinese
4,971View on GitHub
Huatuo-Llama-Med-Chinese is a medical large language model specialized in processing and generating natural language text in Chinese. It is an instruction-tuned system designed to answer professional healthcare questions by leveraging a dedicated medical knowledge base. The model integrates structured medical literature and knowledge graphs to ensure clinical accuracy during response generation. It employs knowledge-graph augmented inference to combine structured entity relationships with neural network outputs. The system is developed through domain-specific weight adaptation, cross-lingual
Pythonaidoctorbloomchinese
View on GitHub4,971

Open-source alternatives to Starcoder

QwenLM/Qwen-7B

meta-llama/llama-models

SCIR-HI/Huatuo-Llama-Med-Chinese

databrickslabs/dolly

huggingface/smollm

meta-pytorch/torchtune

OpenGVLab/LLaMA-Adapter

PaddlePaddle/ERNIE

ymcui/Chinese-LLaMA-Alpaca

microsoft/nlp-recipes

Facico/Chinese-Vicuna

ymcui/Chinese-LLaMA-Alpaca-2

thunlp/UltraChat

meta-llama/codellama

OpenRLHF/OpenRLHF

yangjianxin1/Firefly

OpenBMB/MiniCPM

espnet/espnet

OptimalScale/LMFlow

pengxiao-song/LaWGPT

intel/ipex-llm

facebookresearch/llama-recipes

modelscope/swift

deepseek-ai/DeepSeek-Coder

oumi-ai/oumi

thinking-machines-lab/tinker-cookbook

philschmid/deep-learning-pytorch-huggingface

intel-analytics/BigDL

huggingface/notebooks

huggingface/course