What are the best open-source alternatives to Build Nanogpt?

30 open-source projects similar to karpathy/build-nanogpt, ranked by shared features. Top picks: datawhalechina/so-large-lm, morizeyao/gpt2-chinese, openai/gpt-2, facebookresearch/fairseq, cs231n/cs231n.github.io, datawhalechina/tiny-universe, nndl/llm-beginner, mymusise/chatglm-tuning, 649453932/bert-chinese-text-classification-pytorch, rasbt/machine-learning-book.

Is datawhalechina/so-large-lm a good alternative to Build Nanogpt?

This project is a comprehensive educational curriculum and structured learning path covering the full lifecycle of large language models. It provides a guided progression through the theory, architecture, training, and deployment of these models. The curriculum includes specialized guides on trans…

Is morizeyao/gpt2-chinese a good alternative to Build Nanogpt?

GPT2-Chinese is a Chinese language model implementation based on the GPT-2 architecture. It provides a causal language model trainer and a natural language generation tool designed for training and generating human-like Chinese text sequences. The system integrates a BERT tokenizer to process Chin…

Is openai/gpt-2 a good alternative to Build Nanogpt?

This project is a transformer-based language model and autoregressive text generator designed to predict the next token in a sequence to produce human-like prose and synthetic text. It functions as a large language model that utilizes a transformer architecture to learn linguistic patterns from lar…

Is facebookresearch/fairseq a good alternative to Build Nanogpt?

Fairseq is a PyTorch toolkit for sequence-to-sequence modeling, specializing in neural machine translation, automatic speech recognition, and large-scale language model training. It provides a framework for processing and aligning diverse data sources, including text, audio, and video, to support t…

Is cs231n/cs231n.github.io a good alternative to Build Nanogpt?

This project is a static educational website and comprehensive curriculum focused on computer vision and deep learning. It serves as a public repository of instructional materials, lecture notes, and technical guides specifically detailing convolutional neural networks and visual recognition. The…

Is datawhalechina/tiny-universe a good alternative to Build Nanogpt?

Tiny Universe is an educational monorepo that delivers multiple independent implementations of core AI subsystems as self-contained Jupyter notebooks. It provides from-scratch constructions of foundational architectures including a complete Transformer model built from the original paper specificat…

Is nndl/llm-beginner a good alternative to Build Nanogpt?

This project is a collection of educational resources and technical guides focused on the development and implementation of large language models. It provides a comprehensive curriculum covering transformer architectures, training methods, and deployment strategies. The materials provide detailed…

Is mymusise/chatglm-tuning a good alternative to Build Nanogpt?

This project is a framework for fine-tuning large language models using parameter-efficient training techniques. It provides a structured pipeline for adapting pre-trained transformer models to specific tasks while minimizing the computational resources and memory required during the training proce…

Is 649453932/bert-chinese-text-classification-pytorch a good alternative to Build Nanogpt?

This project is a PyTorch-based Chinese text classification framework. It provides a transformer-based pipeline designed to categorize Chinese language sequences into predefined labels using deep learning models. The implementation supports both BERT and ERNIE language models for processing and ta…

Is rasbt/machine-learning-book a good alternative to Build Nanogpt?

This project is a comprehensive machine learning educational resource and tutorial series delivered as a collection of interactive Jupyter Notebooks. It provides practical Python implementations for the end-to-end machine learning lifecycle, covering supervised and unsupervised learning, deep learn…

Back to karpathy/build-nanogpt

Open-source alternatives to Build Nanogpt

30 open-source projects similar to karpathy/build-nanogpt, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Build Nanogpt alternative.

datawhalechina/so-large-lm
datawhalechina/so-large-lm
7,400View on GitHub
This project is a comprehensive educational curriculum and structured learning path covering the full lifecycle of large language models. It provides a guided progression through the theory, architecture, training, and deployment of these models. The curriculum includes specialized guides on transformer architecture, model training tutorials, and frameworks for designing autonomous agents. It also provides dedicated resources for studying model safety and ethics. The material covers a wide range of technical capabilities, including distributed training strategies, parameter-efficient fine-tu
View on GitHub7,400
morizeyao/gpt2-chinese
Morizeyao/GPT2-Chinese
7,596View on GitHub
GPT2-Chinese is a Chinese language model implementation based on the GPT-2 architecture. It provides a causal language model trainer and a natural language generation tool designed for training and generating human-like Chinese text sequences. The system integrates a BERT tokenizer to process Chinese corpora into manageable units for machine learning. It enables the development of predictive text models that can generate specific patterns, such as news or poetry, through prompt-based text completion. The project covers a full workflow including text tokenization, model training using a trans
Python
View on GitHub7,596
openai/gpt-2
openai/gpt-2
24,967View on GitHub
This project is a transformer-based language model and autoregressive text generator designed to predict the next token in a sequence to produce human-like prose and synthetic text. It functions as a large language model that utilizes a transformer architecture to learn linguistic patterns from large datasets for unsupervised multitask learning. The repository provides a distribution of pre-trained weights, enabling natural language processing tasks without requiring additional training. This allows the model to perform zero-shot task generalization by applying learned patterns to new tasks.
Python
View on GitHub24,967

Open-source alternatives to Build Nanogpt

datawhalechina/so-large-lm

Morizeyao/GPT2-Chinese

openai/gpt-2

facebookresearch/fairseq

cs231n/cs231n.github.io

datawhalechina/tiny-universe

nndl/llm-beginner

mymusise/ChatGLM-Tuning

649453932/Bert-Chinese-Text-Classification-Pytorch

rasbt/machine-learning-book

yunjey/pytorch-tutorial

liguodongiot/llm-action

mistralai/cookbook

microsoft/generative-ai-for-beginners

iusztinpaul/hands-on-llms

datawhalechina/llms-from-scratch-cn

merveenoyan/smol-vision

changyeyu/LLM-RL-Visualized

FareedKhan-dev/all-rag-techniques

karpathy/LLM101n

langgptai/LangGPT

karpathy/nanochat

openai/openai-cookbook

mlabonne/llm-course

andysingal/llm-course

karminski/one-small-step

KellerJordan/modded-nanogpt

facebookresearch/deit

QwenLM/Qwen-7B

jingyaogong/minimind