What are the best open-source alternatives to Bitsandbytes?

30 open-source projects similar to bitsandbytes-foundation/bitsandbytes, ranked by shared features. Top picks: timdettmers/bitsandbytes, pytorch/torchtune, nvlabs/sana, infrasys-ai/aiinfra, artidoro/qlora, afshinea/stanford-cme-295-transformers-large-language-models, openaccess-ai-collective/axolotl, deepspeedai/deepspeedexamples, opennmt/opennmt-py, google-deepmind/gemma.

Is timdettmers/bitsandbytes a good alternative to Bitsandbytes?

bitsandbytes is a quantization library for large language models that reduces memory footprints using k-bit quantization. It provides a framework for 4-bit low-rank adaptation, tools for 8-bit model compression, and memory-efficient optimizer extensions for PyTorch. The project enables the trainin…

Is pytorch/torchtune a good alternative to Bitsandbytes?

Torchtune is a PyTorch-native library for fine-tuning, aligning, and quantizing large language models. It provides a configurable training pipeline orchestrated through YAML recipes, with CLI overrides and component swapping, distributed training via FSDP2, memory optimizations, and parameter-effic…

Is nvlabs/sana a good alternative to Bitsandbytes?

Sana is a framework for high-resolution image and video synthesis based on a linear diffusion transformer. It provides a toolkit for the training, fine-tuning, and execution of text-to-image and text-to-video models, as well as a video generative world model capable of simulating physical environme…

Is infrasys-ai/aiinfra a good alternative to Bitsandbytes?

infrasys-ai/aiinfra is an open-source alternative to Bitsandbytes.

Is artidoro/qlora a good alternative to Bitsandbytes?

This project is a quantized fine-tuning framework for large language models. It implements a low-rank adaptation library and a four-bit quantizer to reduce the GPU memory requirements needed to train large models. The framework utilizes four-bit quantization and low-rank adapters to enable model t…

Is afshinea/stanford-cme-295-transformers-large-language-models a good alternative to Bitsandbytes?

This project is a comprehensive technical course study guide and reference for learning the architectures and training methods of Transformers and large language models. It serves as a technical overview for understanding how neural networks process data and how to align model behavior with specifi…

Is openaccess-ai-collective/axolotl a good alternative to Bitsandbytes?

Axolotl is a distributed training orchestrator and fine-tuning framework for large language models, multimodal systems, and quantized models. It provides a structured environment for specializing pre-trained models through full parameter updates or low-rank adaptation, as well as aligning model out…

Is deepspeedai/deepspeedexamples a good alternative to Bitsandbytes?

DeepSpeedExamples is a collection of reference implementations and scripts for training, fine-tuning, and executing inference on large-scale AI models using DeepSpeed optimization. It provides a distributed model training guide and practical workflows for adapting large language models through memo…

Is opennmt/opennmt-py a good alternative to Bitsandbytes?

OpenNMT-py is a PyTorch neural machine translation framework used for training and deploying neural machine translation and large language models. It functions as a distributed model training system, an inference engine, and a toolkit for fine-tuning large language models. The framework distinguis…

Is google-deepmind/gemma a good alternative to Bitsandbytes?

Gemma is a family of open-weights large language models based on a decoder-only transformer architecture. These models are designed for text generation and multi-modal conversations, capable of processing and generating responses based on both textual and visual input sequences. The project provid…

Back to bitsandbytes-foundation/bitsandbytes

Open-source alternatives to Bitsandbytes

30 open-source projects similar to bitsandbytes-foundation/bitsandbytes, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Bitsandbytes alternative.

timdettmers/bitsandbytes
timdettmers/bitsandbytes
8,277View on GitHub
bitsandbytes is a quantization library for large language models that reduces memory footprints using k-bit quantization. It provides a framework for 4-bit low-rank adaptation, tools for 8-bit model compression, and memory-efficient optimizer extensions for PyTorch. The project enables the training of large models on limited hardware through 4-bit quantization and low-rank adaptation weights. It also facilitates faster inference by compressing models to 8-bit precision using vector-wise quantization. The library covers a range of memory optimization capabilities, including optimizer memory r
Python
View on GitHub8,277
pytorch/torchtune
pytorch/torchtune
5,774View on GitHub
Torchtune is a PyTorch-native library for fine-tuning, aligning, and quantizing large language models. It provides a configurable training pipeline orchestrated through YAML recipes, with CLI overrides and component swapping, distributed training via FSDP2, memory optimizations, and parameter-efficient fine-tuning methods like LoRA, DoRA, and QLoRA. The library distinguishes itself through its YAML-driven configuration system that defines all training parameters and instantiates components from config files, with full CLI override capability for any field or component at launch time. It suppo
Python
View on GitHub5,774
nvlabs/sana
NVlabs/Sana
8,310View on GitHub
Sana is a framework for high-resolution image and video synthesis based on a linear diffusion transformer. It provides a toolkit for the training, fine-tuning, and execution of text-to-image and text-to-video models, as well as a video generative world model capable of simulating physical environments with precise spatial control. The project is distinguished by its use of linear complexity layers to handle high resolutions and its support for long-form, minute-length video generation in real time. It implements a two-stage inference paradigm that separates structural generation from visual t
Python
View on GitHub8,310

Open-source alternatives to Bitsandbytes

timdettmers/bitsandbytes

pytorch/torchtune

NVlabs/Sana

Infrasys-AI/AIInfra

artidoro/qlora

afshinea/stanford-cme-295-transformers-large-language-models

OpenAccess-AI-Collective/axolotl

deepspeedai/DeepSpeedExamples

OpenNMT/OpenNMT-py

google-deepmind/gemma

meta-pytorch/torchtune

zai-org/CogVLM

OpenRLHF/OpenRLHF

intel/ipex-llm

intel/neural-compressor

ml-explore/mlx-examples

meta-llama/llama-models

Infrasys-AI/AISystem

ymcui/Chinese-LLaMA-Alpaca-2

OpenBMB/MiniCPM

modelscope/swift

dusty-nv/jetson-inference

h2oai/h2o-llmstudio

mosaicml/llm-foundry

meta-llama/llama-recipes

Lightning-AI/lit-llama

zhaochenyang20/Awesome-ML-SYS-Tutorial

hiyouga/EasyR1

apachecn/pytorch-doc-zh

predibase/lorax