Long Llama

Long Llama - process massive document sequenc… | Awesome Repos

Open-source alternatives to Long Llama

Similar open-source projects, ranked by how many features they share with Long Llama.

google/gemma_pytorch
google/gemma_pytorch
5,697View on GitHub
The official PyTorch implementation of Google's Gemma models
Pythongemmagooglepytorch
View on GitHub5,697
01-ai/yi
01-ai/Yi
7,822View on GitHub
Yi is a bilingual language model and foundation model designed for natural language processing, reasoning, and reading comprehension in both English and Chinese. It is built as a transformer-based architecture capable of general purpose text generation and conversational tasks. The model is distinguished by its ability to function as a long context system, processing and analyzing extended input sequences up to 200k tokens. It also supports quantized versions that use low-bit precision to reduce memory footprints, enabling execution on consumer-grade hardware. The project covers a broad rang
Jupyter Notebooklarge-language-models
View on GitHub7,822
thinking-machines-lab/tinker-cookbook
thinking-machines-lab/tinker-cookbook
2,856View on GitHub
Tinker Cookbook is an open-source framework for fine-tuning large language models, supporting supervised learning, reinforcement learning, and parameter-efficient techniques like LoRA adapters. It provides a complete pipeline for aligning models with human preferences through multi-stage RLHF workflows, from supervised fine-tuning through preference optimization to reinforcement learning. The framework distinguishes itself through recipe-based training orchestration, where fine-tuning workflows are defined as composable recipe files that chain data loading, model configuration, and training l
Python
View on GitHub2,856
qwenlm/qwen2.5
QwenLM/Qwen2.5
27,307View on GitHub
Qwen2.5 is a suite of large language model foundation models designed for natural language generation, code production, and complex mathematical reasoning. The project encompasses a multilingual language model capable of processing dozens of languages and a specialized code generation model for technical problem solving and debugging. The framework is distinguished by its long context capabilities, enabling the analysis of massive inputs ranging from 256K up to 1 million tokens. It further functions as an agentic framework, utilizing standardized templates and parsers to execute autonomous wo
Python
View on GitHub27,307

See all 30 alternatives to Long Llama

CStanKonradlong_llama

Features

Open-source alternatives to Long Llama

google/gemma_pytorch

01-ai/Yi

thinking-machines-lab/tinker-cookbook

QwenLM/Qwen2.5

Star history

Open-source alternatives to Long Llama

google/gemma_pytorch

01-ai/Yi

thinking-machines-lab/tinker-cookbook

QwenLM/Qwen2.5