ML frameworks and MLOps

Explore libraries for building machine learning models and tools for managing their entire production lifecycle.

Find the best repos with AI.We'll search the best matching repositories with AI.

keras-team/keras
keras-team/keras
64,094View on GitHub
Keras is a high-level deep learning framework designed for constructing and training neural networks through the composition of modular, functional layers. It serves as a comprehensive modeling toolkit that provides standardized procedures for defining, evaluating, and deploying complex architectures. By utilizing a directed acyclic graph approach, the framework allows users to build intricate models with multiple inputs, outputs, and shared layers, ensuring consistent numerical execution through functional state management. The project distinguishes itself as a multi-backend machine learning
PythonFrameworksModel DefinitionArchitectures
View on GitHub64,094
eugeneyan/applied-ml
eugeneyan/applied-ml
29,783View on GitHub
This project is a comprehensive, curated knowledge base designed to support the development and maintenance of production-grade machine learning systems. It serves as a centralized repository of industry-standard technical literature, engineering case studies, and research papers, providing a structured reference for practitioners navigating the complexities of modern data science and machine learning engineering. The resource distinguishes itself through a cross-domain approach that bridges the gap between academic research and practical implementation. By synthesizing proven industry archit
Lifecycle ManagementData PipelinesMachine Learning Operations Platforms
View on GitHub29,783
ageron/handson-ml2
ageron/handson-ml2
29,938View on GitHub
This project provides a collection of practical machine learning code examples, including implementations for supervised, unsupervised, and reinforcement learning algorithms. It features deep learning model implementations for convolutional, recurrent, and generative architectures, alongside specific examples of reinforcement learning agents that maximize rewards in simulated environments. The repository includes dedicated data preprocessing pipelines for sanitization, feature scaling, and dimensionality reduction. It also provides implementations for a wide range of specific models, such as
Jupyter NotebookMachine Learning ImplementationsConvolutional Neural NetworksData Preparation
View on GitHub29,938
lightning-ai/pytorch-lightning
Lightning-AI/pytorch-lightning
31,201View on GitHub
PyTorch Lightning is a deep learning research framework that provides a structured environment for organizing machine learning code. It functions as a unified trainer orchestrator, centralizing the execution flow by managing the interaction between hardware resources, data loaders, and model components. By decoupling model architecture from training logic, the framework enables researchers to maintain clean, modular codebases that remain portable across different environments. The framework distinguishes itself through a hardware-agnostic abstraction layer that scales deep learning workloads
PythonDeep Learning FrameworksModular Training OrchestratorsTraining Orchestrators
View on GitHub31,201
voltagent/awesome-claude-code-subagents
VoltAgent/awesome-claude-code-subagents
21,906View on GitHub
This project provides a framework for managing multi-agent systems, designed to automate complex software development, infrastructure, and business workflows. It functions as a multi-agent workflow orchestrator that routes tasks to domain-specific workers while maintaining state persistence and infrastructure automation. By leveraging large language models, the system decomposes high-level objectives into actionable plans, ensuring that complex operations are executed with consistency and reliability. The framework distinguishes itself through its hierarchical agent registry and policy-driven
ShellAgent Discovery InterfacesAgentic Task AutomationAgentic Task Orchestrators
View on GitHub21,906
hiyouga/llamafactory
hiyouga/LlamaFactory
72,213View on GitHub
LlamaFactory is a unified framework for fine-tuning and adapting large language models. It provides a comprehensive platform that standardizes training workflows across diverse machine learning architectures, allowing users to execute both full-tuning and parameter-efficient methods through a single interface. The project distinguishes itself by offering a low-code visual dashboard that enables users to configure experiments and monitor performance metrics in real time without writing extensive custom scripts. It also features a configuration-driven orchestration system that decouples experim
PythonExperiment TrackingLanguage Model Fine-TuningLarge Language Model Fine-Tuning Frameworks
View on GitHub72,213
pycaret/pycaret
pycaret/pycaret
9,811View on GitHub
PyCaret is a Python AutoML platform and MLOps lifecycle manager designed to automate machine learning workflows. It functions as a low-code environment that leverages a scikit-learn native engine to execute preprocessing, training, and evaluation for tabular data. The platform distinguishes itself as an LLM-powered ML copilot, using large language model agents to analyze datasets, design experiment configurations, and explain model results. It also serves as a Kubernetes ML orchestrator and model registry, enabling the versioning of trained pipelines and their promotion to production API endp
PythonAutomated Machine LearningMachine Learning Workflow LibrariesAI Agent Integrations
View on GitHub9,811
tensorflow/models
tensorflow/models
77,663View on GitHub
This repository serves as a centralized collection of state-of-the-art deep learning architectures and reference implementations designed for research and application development. It provides a comprehensive toolkit for computer vision and natural language processing, offering pre-built models and training pipelines for tasks ranging from image classification and object detection to complex sequence modeling. The project distinguishes itself by providing a flexible execution harness that manages the entire training lifecycle, including data ingestion and backpropagation. It supports scalable
PythonComputer Vision ModelsDevelopment and Orchestration ToolsDistributed Parameter Synchronisation
View on GitHub77,663
chiphuyen/dmls-book
chiphuyen/dmls-book
4,395View on GitHub
This is a reference guide for designing, deploying, and maintaining production-ready machine learning systems, grounded in MLOps best practices. It covers the complete machine learning lifecycle, from system design and workflow planning through to deployment and ongoing maintenance, with a focus on reliability, scalability, and maintainability as business requirements evolve. The guide provides an architecture reference for establishing shared ML infrastructure, including model registries and feature stores that standardize asset reuse across teams. It details pipeline automation through conf
Production Machine Learning GuidesDAG-Based OrchestrationFeature Stores
View on GitHub4,395
hpcaitech/colossalai
hpcaitech/ColossalAI
41,395View on GitHub
ColossalAI is a distributed deep learning framework designed for training and deploying massive artificial intelligence models across clusters of hardware accelerators. It functions as a parallel computing engine that partitions model workloads and data across multiple processors to maximize memory efficiency and throughput. The platform distinguishes itself through a comprehensive suite of parallelization strategies, including multi-dimensional tensor parallelism and pipeline-based model parallelism, which segment neural network layers and stages across devices. To support large-scale genera
PythonDistributed Deep Learning FrameworksDistributed Training OrchestratorsLarge-Scale Model Training
View on GitHub41,395
karpathy/nanochat
karpathy/nanochat
55,103View on GitHub
Nanochat is a lightweight execution environment designed for training and running language models on standard consumer hardware. It functions as both a neural network training framework and an inference engine, enabling users to perform backpropagation-based training and model execution directly on general-purpose processors without the need for dedicated graphics hardware. The project distinguishes itself through a suite of optimization tools that prioritize efficiency on local machines. By utilizing memory-mapped weight loading and CPU-optimized vector math, it maximizes throughput for inte
PythonLocal Inference RuntimesTransformer Inference EnginesTraining Frameworks
View on GitHub55,103
ethicalml/awesome-production-machine-learning
EthicalML/awesome-production-machine-learning
20,638View on GitHub
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Awesome ListApplied Machine LearningCurated Research Lists
View on GitHub20,638
tensorflow/tensorflow
tensorflow/tensorflow
195,697View on GitHub
TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The system provides high-level interfaces for defining neural network architectures, alongside a robust engine for managing multidimensional array structures and tensor mathematics. The framework distinguishes itself through a scalable distributed runtime that orchestrates workloads acr
C++FrameworksDeferred-Execution Symbolic GraphsDistributed Training Frameworks
View on GitHub195,697
rapidai/rapidocr
RapidAI/RapidOCR
5,968View on GitHub
RapidOCR is an offline deep-learning OCR engine that detects and recognizes text in images using ONNX Runtime, operating entirely without an internet connection. It provides a unified inference pipeline that runs across multiple platforms including Windows, Linux, macOS, Android, and Raspberry Pi, with programming language bindings for Python, C++, Java, and C#. The engine separates text detection and recognition into independent modules that can be swapped or fine-tuned individually, and abstracts the inference backend behind a unified interface allowing seamless switching between ONNX Runti
PythonOCR PipelinesCross-Platform Offline OCRMulti-Language Recognition Models
View on GitHub5,968
zsdonghao/tensorlayer
zsdonghao/tensorlayer
7,384View on GitHub
Tensorlayer is a deep learning framework and cross-backend AI library used to construct and execute neural network models. It serves as a scientific neural network toolkit providing customizable layers and architectures designed for research applications in science and engineering. The library enables multi-backend model execution, allowing the same model code to run across different deep learning frameworks, GPUs, and specialized AI accelerators. It includes a reinforcement learning library that provides both low-level and high-level tools for developing intelligent agents.
PythonMulti-Backend AbstractionsBackend-Agnostic EnginesCross-Framework API Wrappers
View on GitHub7,384
google-ai-edge/mediapipe
google-ai-edge/mediapipe
35,660View on GitHub
MediaPipe is a cross-platform machine learning framework designed for deploying vision, audio, and text processing models across mobile, desktop, and web environments. It functions as an on-device inference engine that executes complex models locally on edge hardware, ensuring low latency and privacy without requiring a constant internet connection. The framework utilizes a graph-based pipeline orchestration system where data flows through a directed network of modular calculators to ensure synchronized and deterministic processing. It distinguishes itself through a unified runtime that provi
C++Machine Learning FrameworksCross-Platform Inference FrameworksModel Deployment Frameworks
View on GitHub35,660
huggingface/open-r1
huggingface/open-r1
26,326View on GitHub
Open-r1 is a framework designed for the large-scale training, distillation, and optimization of language models focused on complex reasoning and programming tasks. It provides a comprehensive suite of tools for managing distributed training jobs across multi-node clusters, enabling the development of high-performance models through reinforcement learning and supervised fine-tuning. The project distinguishes itself by integrating secure, containerized code execution environments directly into the training and evaluation lifecycle. By allowing models to run and verify code snippets against test
PythonCode-Integrated Training FrameworksLarge Scale Training SuitesReasoning Model Training Suites
View on GitHub26,326
visenger/awesome-mlops
visenger/awesome-mlops
13,942View on GitHub
An awesome list of references for MLOps - Machine Learning Operations :pointright: ml-ops.org*
Awesome ListAwesome ListsCommunity Resources
View on GitHub13,942
aymericdamien/tensorflow-examples
aymericdamien/TensorFlow-Examples
43,749View on GitHub
This repository serves as a structured educational resource for machine learning and deep learning, providing a library of executable scripts and notebooks. It is designed to help users master the practical application of data processing, model evaluation, and neural network construction through annotated code samples and guided tutorials. The collection focuses on translating theoretical mathematical concepts into functional code, offering proven patterns for common tasks such as classification and regression. By providing curated examples of layer construction and training loops, the reposi
Jupyter NotebookAutomatic Differentiation EnginesDeep Learning Code LibrariesTensor Processing Libraries
View on GitHub43,749
nousresearch/hermes-agent
NousResearch/hermes-agent
195,049View on GitHub
Hermes-agent is an autonomous AI agent framework and runtime designed to execute complex tasks and synthesize new skills from execution traces. It includes a provider-agnostic gateway for routing requests across multiple model backends and a serverless runtime that suspends idle agent instances and resumes them on demand across containers and virtual machines. The project provides a desktop automation toolset that controls native GUI workflows on Linux by querying accessibility APIs and injecting input events. It further distinguishes itself with the ability to generate procedural skills from
PythonAutonomous Agent FrameworksAutonomous Task ExecutionAccessibility Tree Automation
View on GitHub195,049
deepseek-ai/deepseek-v3
deepseek-ai/DeepSeek-V3
103,753View on GitHub
DeepSeek-V3 is a large language model that provides comprehensive resources for model utilization, including technical specifications, pre-trained weights, and evaluation benchmarks. The project details the core transformer architecture, including parameter counts and multi-token prediction modules, while supporting native 8-bit floating-point quantization. The repository offers extensive support for local and distributed inference through integration with multiple frameworks and engines. It includes documentation for deploying the model across various hardware configurations, such as GPUs an
PythonModel WeightsInference FrameworksFrontier Models
View on GitHub103,753
replicate/cog
replicate/cog
9,424View on GitHub
Cog is a machine learning packaging tool and containerized model wrapper that bundles models and their dependencies into standardized Docker containers. It functions as an environment manager and inference server, ensuring consistent model execution across different hardware systems by resolving GPU drivers, system libraries, and Python dependencies. The project distinguishes itself by automatically generating RESTful HTTP servers and OpenAPI schemas based on defined model input and output types. It manages large model weights as external fixtures to optimize image size and utilizes a slot-ba
GoDeclarative Image SynthesisMachine Learning Model PortabilityModel Containerization Tools
View on GitHub9,424
ultralytics/ultralytics
ultralytics/ultralytics
58,468View on GitHub
Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification. By utilizing a modular architecture, the platform allows users to swap model components to balance inference speed and accuracy requirements for diverse applications. The framework distinguishes itself through its support for real-time processing and flexible deployment. It in
PythonComputer VisionModel Training and Inference EnginesComputer Vision Training Frameworks
View on GitHub58,468
tracel-ai/burn
tracel-ai/burn
15,474View on GitHub
Burn is a deep learning framework designed for building, training, and deploying neural networks using a modular architecture. As a machine learning library built in Rust, it provides a backend-agnostic computational engine that enables the execution of models across diverse hardware, including central processors, graphics processors, and web runtimes. The framework distinguishes itself through a highly portable design that allows developers to maintain a single workflow for both training and inference across heterogeneous environments. It incorporates advanced optimization techniques such as
RustDeep Learning FrameworksBackend-Agnostic EnginesDeep Learning Libraries
View on GitHub15,474
tensorflow/tfjs-examples
tensorflow/tfjs-examples
6,783View on GitHub
This repository provides a collection of practical demonstrations and implementation guides for machine learning tasks using TensorFlow.js. It serves as a resource for developers to explore model architectures, training workflows, and data manipulation techniques across domains such as computer vision, natural language processing, and reinforcement learning. The project covers the full lifecycle of machine learning development, including tensor-based mathematical operations, model construction via high-level layer APIs or low-level tensor logic, and model serialization for various storage med
JavaScriptManual Memory ManagementCore Model APIsModel Execution APIs
View on GitHub6,783
logicalclocks/hopsworks
logicalclocks/hopsworks
1,302View on GitHub
Hopsworks - Data-Intensive AI platform with a Feature Store
JavaData Science ToolsDeep Learning FrameworksFeature Stores
View on GitHub1,302
karpathy/nanogpt
karpathy/nanoGPT
59,730View on GitHub
nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predict subsequent elements. The project distinguishes itself through a focus on high-speed data ingestion and hardware-accelerated performance. It includes a dedicated pipeline for transforming raw text into memory-mapped binary files, which enables efficient streaming during traini
PythonTransformerGenerative Text InferenceLarge Language Model Training Frameworks
View on GitHub59,730
pypl0/ombre
pypl0/Ombre
6View on GitHub
Open source AI infrastructure layer. Eight agents run on every request: security, caching, memory, hallucination detection, tamper-proof audit trail. One line of code. Your data never leaves your server.
PythonMLOps and Infrastructure
View on GitHub6
meta-llama/llama
meta-llama/llama
59,464View on GitHub
Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on your own hardware. The system distinguishes itself through specialized memory and computation management techniques, including memory-mapped weight loading and quantization-aware inference, which allow for efficient execution on standard consumer hardware. It utilizes a stateles
PythonInference EnginesLarge Language Model RuntimesLocal Inference Engines
View on GitHub59,464
gojek/feast
gojek/feast
7,095View on GitHub
Feast is a machine learning feature store and MLOps data infrastructure layer. It provides a centralized system for managing and serving features across offline training and online production environments, utilizing an online feature serving layer for low-latency retrieval. The project centers on a feature registry that acts as a central catalog for defining, governing, and discovering feature services. It employs a unified data access layer to decouple feature retrieval from physical storage and includes a point-in-time data generator to create historically accurate training datasets that pr
PythonFeature StoresFeature Definition ConsistencyFeature Management
View on GitHub7,095
qwenlm/qwen3
QwenLM/Qwen3
27,324View on GitHub
Qwen3 is a transformer-based large language model designed as a generative AI foundation for understanding, reasoning, and generating human language. It functions as a comprehensive ecosystem for model training, fine-tuning, and production-ready inference, providing the underlying architecture and weights necessary to build diverse artificial intelligence applications. The project distinguishes itself through extensive support for model quantization and distributed inference, enabling efficient execution across a wide range of hardware from consumer-grade devices to scalable cloud infrastruct
PythonGenerative AI FoundationsLarge Language ModelsModel Training Frameworks
View on GitHub27,324
polyaxon/polyaxon
polyaxon/polyaxon
3,707View on GitHub
AI Infra / AI Orchestration / AI Control Plane
MDXDeep Learning EcosystemsDeep Learning FrameworksDistributed Frameworks
View on GitHub3,707
ultralytics/yolov5
ultralytics/yolov5
57,528View on GitHub
YOLOv5 is a comprehensive computer vision framework designed for end-to-end deep learning, specializing in real-time object detection, image classification, and instance segmentation. It provides a unified toolkit that manages the entire lifecycle of a model, from initial dataset configuration and hyperparameter tuning to high-speed inference and deployment. The framework utilizes a modular neural architecture, allowing users to swap backbone and head components to tailor models for specific visual tasks. What distinguishes this project is its focus on production-ready deployment and model ef
PythonComputer VisionObject DetectionReal-Time
View on GitHub57,528
feast-dev/feast
feast-dev/feast
6,727View on GitHub
Feast is an open-source feature store for machine learning that provides a central platform for defining, storing, and serving features across both training and inference workflows. It operates as a declarative system where feature definitions are written as code in Python files, synchronized to a central registry, and made available for low-latency online retrieval or point-in-time correct historical joins for training datasets. The project abstracts storage behind a pluggable architecture, allowing offline and online backends to be swapped without changing retrieval logic, and coordinates ma
PythonFeature StoresAccess ControlAccess Control and RBAC
View on GitHub6,727
huggingface/pytorch-image-models
huggingface/pytorch-image-models
36,893View on GitHub
This project is a comprehensive library of state-of-the-art neural network architectures designed for image classification and feature extraction. It provides a complete deep learning training framework that supports distributed execution, allowing users to build, train, and fine-tune vision models using optimized schedulers and pre-configured training recipes. The library distinguishes itself through a modular backbone architecture that treats neural networks as decoupled feature extractors, enabling the retrieval of multi-scale outputs for downstream tasks like object detection and segmenta
PythonComputer Vision ModelsComputer Vision TrainingDistributed Training Frameworks
View on GitHub36,893
ml-tooling/ml-workspace
ml-tooling/ml-workspace
3,540View on GitHub
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Jupyter NotebookAI and Machine LearningData Science ToolingData Science Tools
View on GitHub3,540
unslothai/unsloth
unslothai/unsloth
66,628View on GitHub
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade hardware. The platform distinguishes itself through hand-optimized kernels and automated computational graph techniques that maximize hardware throughput. It supports advanced training methodologies, including reinforcement learning for reasoning and efficient adapter-based fin
PythonLanguage Model TrainingCustom Kernel AcceleratorsEfficient Training Pipelines
View on GitHub66,628
iterative/dvc
iterative/dvc
15,680View on GitHub
DVC is a data versioning tool and pipeline orchestrator designed to track large datasets and machine learning models. It functions as a system for managing large data artifacts by storing lightweight metadata in version control while keeping the actual binaries in a separate cache. The project serves as an experiment tracker and remote storage synchronizer, enabling the execution and comparison of machine learning iterations based on hyperparameters and performance metrics. It provides a bridge for pushing and pulling these large data artifacts between local environments and cloud or on-premi
PythonDataset Versioning SystemsPointer-Based TrackingContent-Addressable Storage
View on GitHub15,680
vllm-project/vllm
vllm-project/vllm
83,048View on GitHub
vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token generation speed and memory efficiency, enabling both large-scale cloud deployments and local execution on personal hardware. The project distinguishes itself through advanced memory management and request scheduling techniques, most notably its use of non-contiguous key-value cach
PythonContinuous Batching StrategiesCustom Model Execution EnginesDistributed Model Servers
View on GitHub83,048
minerva-ml/steppy
minerva-ml/steppy
136View on GitHub
Lightweight, Python library for fast and reproducible experimentation :microscope:
PythonData Science ToolingData Science ToolsMLOps and Pipelines
View on GitHub136
jingyaogong/minimind
jingyaogong/minimind
51,834View on GitHub
This project is a comprehensive framework for the entire lifecycle of transformer-based language models, supporting everything from foundational pretraining to specialized deployment. It provides a modular toolkit for defining neural network architectures, managing data preparation pipelines, and executing training routines across various scales. The framework is designed to handle the full model development process, including supervised fine-tuning, behavioral alignment, and the integration of agentic capabilities. What distinguishes this framework is its focus on efficient training and adva
PythonModel Training ToolkitsAgentic FrameworksAgentic Training Frameworks
View on GitHub51,834
minerva-ml/steppy-toolkit
minerva-ml/steppy-toolkit
23View on GitHub
Curated set of transformers that make your work with steppy faster and more effective :telescope:
PythonData Science ToolingData Science ToolsMLOps and Pipelines
View on GitHub23
deepspeedai/deepspeed
deepspeedai/DeepSpeed
42,528View on GitHub
DeepSpeed is a high-performance library designed to scale deep learning model training and inference across massive clusters of GPUs and compute nodes. It provides a comprehensive suite of tools for distributed training, enabling the execution of models that exceed the memory capacity of single devices through advanced parameter partitioning, pipeline-based model parallelism, and memory-efficient state offloading. The framework distinguishes itself through specialized communication-efficient optimizers and hardware-aware acceleration techniques. By utilizing gradient compression, quantization
PythonDistributed Memory OptimizersDistributed Training FrameworksDistributed Training Optimizers
View on GitHub42,528
asavinov/lambdo
asavinov/lambdo
26View on GitHub
Feature engineering and machine learning: together at last!
PythonData Science ToolingData Science ToolsMLOps and Pipelines
View on GitHub26
scikit-learn/scikit-learn
scikit-learn/scikit-learn
66,344View on GitHub
Scikit-learn is a machine learning library for predictive data analysis that provides a collection of algorithms for supervised and unsupervised learning. It functions as a comprehensive toolkit for data preprocessing, dimensionality reduction, and model selection, allowing users to classify data objects, predict continuous values, and cluster similar items based on historical patterns. The project is defined by a unified interface design where objects either learn from data, transform data, or chain these operations into sequential workflows. To ensure performance on large or high-dimensiona
PythonDimensionality Reduction EnginesFrameworksPipeline Patterns
View on GitHub66,344
dslp/dslp
dslp/dslp
527View on GitHub
The Data Science Lifecycle Process is a process for taking data science teams from Idea to Value repeatedly and sustainably. The process is documented in this repo.
Data Science ToolingData Science ToolsMLOps and Pipelines
View on GitHub527
mlabonne/llm-course
mlabonne/llm-course
80,178View on GitHub
This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as well as the practical implementation of supervised instruction fine-tuning and preference-based model alignment. The repository distinguishes itself by providing a deep dive into advanced model composition and optimization techniques. It details methodologies for weight-space mode
AI Research RepositoriesAwesome ListFine-Tuning Strategies
View on GitHub80,178
modular/modular
modular/modular
26,357View on GitHub
Modular is a unified machine learning development platform designed for building, compiling, and deploying high-performance neural network models. It provides a comprehensive execution engine that supports both local and production-grade inference, enabling developers to manage the entire model lifecycle from initial architecture definition to scalable, containerized service deployment. The platform distinguishes itself through a hardware-agnostic runtime that abstracts diverse silicon architectures, allowing models to execute efficiently across varied compute environments. It includes a spec
MojoGenerative AI FrameworksInference RuntimesLocal Model Servers
View on GitHub26,357

ML frameworks and MLOps

keras-team/keras

eugeneyan/applied-ml

ageron/handson-ml2

Lightning-AI/pytorch-lightning

VoltAgent/awesome-claude-code-subagents

hiyouga/LlamaFactory

pycaret/pycaret

tensorflow/models

chiphuyen/dmls-book

hpcaitech/ColossalAI

karpathy/nanochat

EthicalML/awesome-production-machine-learning

tensorflow/tensorflow

RapidAI/RapidOCR

zsdonghao/tensorlayer

google-ai-edge/mediapipe

huggingface/open-r1

visenger/awesome-mlops

aymericdamien/TensorFlow-Examples

NousResearch/hermes-agent

deepseek-ai/DeepSeek-V3

replicate/cog

ultralytics/ultralytics

tracel-ai/burn

tensorflow/tfjs-examples

logicalclocks/hopsworks

karpathy/nanoGPT

pypl0/Ombre

meta-llama/llama

gojek/feast

QwenLM/Qwen3

polyaxon/polyaxon

ultralytics/yolov5

feast-dev/feast

huggingface/pytorch-image-models

ml-tooling/ml-workspace

unslothai/unsloth

iterative/dvc

vllm-project/vllm

minerva-ml/steppy

jingyaogong/minimind

minerva-ml/steppy-toolkit

deepspeedai/DeepSpeed

asavinov/lambdo

scikit-learn/scikit-learn

dslp/dslp

mlabonne/llm-course

modular/modular