What are the best open-source alternatives to Starcoder2?

30 open-source projects similar to bigcode-project/starcoder2, ranked by shared features. Top picks: salesforce/codet5, mistralai/mistral-src, hiyouga/llama-factory, intel-analytics/bigdl, masa3141/japanese-alpaca-lora, microsoft/lora, hpprc/llm-lora-classification, huggingface/accelerate, huggingface/peft, artidoro/qlora.

Is salesforce/codet5 a good alternative to Starcoder2?

Home of CodeT5: Open Code LLMs for Code Understanding and Generation

Is mistralai/mistral-src a good alternative to Starcoder2?

This project is a large language model inference library and framework designed to run models for text generation, problem solving, and coding assistance. It includes a multimodal framework for processing combined image and text inputs and a tool-use implementation that enables the execution of ext…

Is hiyouga/llama-factory a good alternative to Starcoder2?

LLaMA-Factory is a comprehensive suite for dataset preparation, model fine-tuning, memory optimization, and standardized API deployment. It provides a unified platform for the supervised and reward-based fine-tuning of large language models and vision-language models. The framework includes a spec…

Is intel-analytics/bigdl a good alternative to Starcoder2?

BigDL is a PyTorch acceleration framework and distributed inference engine designed for large language models. It provides a toolkit for running models on Intel hardware, integrating quantization tools and libraries for parameter-efficient fine-tuning. The project distinguishes itself through the…

Is masa3141/japanese-alpaca-lora a good alternative to Starcoder2?

A japanese finetuned instruction LLaMA

Is microsoft/lora a good alternative to Starcoder2?

LoRA is a framework for parameter-efficient fine-tuning of large-scale neural networks. It functions by injecting trainable low-rank decomposition matrices into frozen model layers, allowing for task-specific adaptation while preserving the integrity of the original base model weights. The project…

Is hpprc/llm-lora-classification a good alternative to Starcoder2?

LLMとLoRAを用いたテキスト分類

Is huggingface/accelerate a good alternative to Starcoder2?

Accelerate is a PyTorch distributed training library that abstracts the boilerplate required to run models across multiple GPUs, TPUs, and CPUs. It functions as a deep learning model scaler and distributed hardware orchestrator, allowing the same training script to run on different hardware backend…

Is huggingface/peft a good alternative to Starcoder2?

This library provides a framework for parameter-efficient fine-tuning, enabling the adaptation of large pretrained models by training only a small subset of parameters. It functions as a distributed model training system and optimization toolkit, designed to reduce the computational and memory requ…

Is artidoro/qlora a good alternative to Starcoder2?

This project is a quantized fine-tuning framework for large language models. It implements a low-rank adaptation library and a four-bit quantizer to reduce the GPU memory requirements needed to train large models. The framework utilizes four-bit quantization and low-rank adapters to enable model t…

Back to bigcode-project/starcoder2

Open-source alternatives to Starcoder2

Q: Is hpprc/llm-lora-classification a good alternative to Starcoder2?

LLMとLoRAを用いたテキスト分類

30 open-source projects similar to bigcode-project/starcoder2, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Starcoder2 alternative.

salesforce/codet5
salesforce/CodeT5
3,098View on GitHub
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
Python
View on GitHub3,098
mistralai/mistral-src
mistralai/mistral-src
10,821View on GitHub
This project is a large language model inference library and framework designed to run models for text generation, problem solving, and coding assistance. It includes a multimodal framework for processing combined image and text inputs and a tool-use implementation that enables the execution of external functions based on model reasoning. The system features a distributed GPU inference engine that spreads large model workloads across multiple graphics processors to increase processing speed and meet memory requirements. It also provides containerized model deployment through pre-packaged imag
Jupyter Notebook
View on GitHub10,821
hiyouga/llama-factory
hiyouga/LLaMA-Factory
72,241View on GitHub
LLaMA-Factory is a comprehensive suite for dataset preparation, model fine-tuning, memory optimization, and standardized API deployment. It provides a unified platform for the supervised and reward-based fine-tuning of large language models and vision-language models. The framework includes a specialized toolkit for training vision-language models and a model serving interface that deploys trained models through high-performance APIs. It utilizes precision tuning and quantization techniques to reduce the hardware requirements and memory footprint of large models. The system covers data pipel
Python
View on GitHub72,241
intel-analytics/bigdl
intel-analytics/BigDL
8,845View on GitHub
BigDL is a PyTorch acceleration framework and distributed inference engine designed for large language models. It provides a toolkit for running models on Intel hardware, integrating quantization tools and libraries for parameter-efficient fine-tuning. The project distinguishes itself through the use of pipeline parallelism to distribute model workloads across multiple hardware accelerators. It utilizes low-bit integer quantization and speculative decoding to reduce memory footprints and decrease text generation latency. The system covers broad capabilities in model optimization, including w
Python
View on GitHub8,845

Open-source alternatives to Starcoder2

salesforce/CodeT5

mistralai/mistral-src

hiyouga/LLaMA-Factory

intel-analytics/BigDL

masa3141/japanese-alpaca-lora

microsoft/LoRA

hppRC/llm-lora-classification

huggingface/accelerate

huggingface/peft

artidoro/qlora

langchain-ai/langsmith-sdk

lxe/simple-llm-finetuner

microsoft/guidance

BerriAI/litellm

huggingface/optimum

hpcaitech/ColossalAI

dvmazur/mixtral-offloading

bigscience-workshop/petals

chroma-core/chroma

facebookresearch/fairscale

facebookresearch/llama

facebookresearch/llama-recipes

langchain-ai/langchain

Facico/Chinese-Vicuna

LinkSoul-AI/Chinese-Llama-2-7b

lm-sys/FastChat

databrickslabs/dolly

microsoft/DeepSpeed

higgsfield-ai/higgsfield

jzhang38/TinyLlama