What are the best open-source alternatives to Cog?

30 open-source projects similar to replicate/cog, ranked by shared features. Top picks: huggingface/text-generation-inference, microsoft/onnxruntime, tensorflow/serving, tingsongyu/pytorch-tutorial-2nd, triton-inference-server/server, mlflow/mlflow, ludwig-ai/ludwig, sgl-project/sglang, pytorch/serve, paddlepaddle/paddledetection.

Is huggingface/text-generation-inference a good alternative to Cog?

Text Generation Inference is a production-ready engine designed for the deployment and serving of large language models. It functions as a containerized runtime environment that manages model execution, scales across distributed hardware, and provides high-performance inference capabilities for dem…

Is microsoft/onnxruntime a good alternative to Cog?

This project is a cross-platform machine learning inference engine designed to execute pre-trained models across diverse operating systems and hardware environments. It functions as a standardized execution framework that manages the entire lifecycle of model inference, from loading and graph optim…

Is tensorflow/serving a good alternative to Cog?

TensorFlow Serving is a high-performance machine learning inference server designed to deploy TensorFlow models to production environments. It functions as a complete serving system that executes predictions on input data through a graph executor, providing network endpoints that eliminate the need…

Is tingsongyu/pytorch-tutorial-2nd a good alternative to Cog?

This project is a comprehensive instructional resource and course for building neural networks using PyTorch. It covers the fundamental building blocks of deep learning, including tensor manipulation, automatic differentiation, and the construction of modular neural network components. The reposit…

Is triton-inference-server/server a good alternative to Cog?

Triton Inference Server is a high-performance server designed to deploy machine learning models from multiple frameworks across GPUs and CPUs. It functions as a hardware-accelerated inference engine and a gRPC inference gateway, providing a standardized communication layer for transmitting binary t…

Is mlflow/mlflow a good alternative to Cog?

mlflow/mlflow is an open-source alternative to Cog.

Is ludwig-ai/ludwig a good alternative to Cog?

Ludwig is a multimodal machine learning platform and low-code framework designed for building, training, and deploying neural networks. It enables the construction of models that process text, images, audio, and tabular data through a unified interface using declarative configuration files rather t…

Is sgl-project/sglang a good alternative to Cog?

Sglang is a high-performance inference engine and serving system designed for large language and multimodal models. It provides a programmable interface for orchestrating complex generation workflows, enabling developers to coordinate multi-turn dialogues, tool invocations, and reasoning chains thr…

Is pytorch/serve a good alternative to Cog?

This project is a PyTorch model serving framework designed to deploy and scale machine learning models in production via scalable network endpoints. It functions as a high-performance inference server, optimizer, and model lifecycle manager that handles model loading, request batching, and hardware…

Is paddlepaddle/paddledetection a good alternative to Cog?

PaddleDetection is an object detection framework designed for the end-to-end development, training, and deployment of computer vision models. It provides a comprehensive library of modular neural network architectures and pipelines that support object detection, instance segmentation, and multi-obj…

Back to replicate/cog

Open-source alternatives to Cog

30 open-source projects similar to replicate/cog, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Cog alternative.

huggingface/text-generation-inference
huggingface/text-generation-inference
10,775View on GitHub
Text Generation Inference is a production-ready engine designed for the deployment and serving of large language models. It functions as a containerized runtime environment that manages model execution, scales across distributed hardware, and provides high-performance inference capabilities for demanding production environments. The project distinguishes itself through advanced optimization techniques, including continuous batching to maximize hardware utilization and tensor parallelism to shard large models across multiple accelerator cards. It supports efficient inference through custom com
Pythonbloomdeep-learningfalcon
View on GitHub10,775
microsoft/onnxruntime
microsoft/onnxruntime
19,347View on GitHub
This project is a cross-platform machine learning inference engine designed to execute pre-trained models across diverse operating systems and hardware environments. It functions as a standardized execution framework that manages the entire lifecycle of model inference, from loading and graph optimization to hardware-accelerated execution and generative sequence management. The runtime distinguishes itself through a highly modular architecture that decouples model logic from hardware-specific kernels. By utilizing an execution provider abstraction, it enables developers to offload computation
C++ai-frameworkdeep-learninghardware-acceleration
View on GitHub19,347
tensorflow/serving
tensorflow/serving
6,351View on GitHub
TensorFlow Serving is a high-performance machine learning inference server designed to deploy TensorFlow models to production environments. It functions as a complete serving system that executes predictions on input data through a graph executor, providing network endpoints that eliminate the need for a separate runtime environment for client applications. The system is distinguished by its model version manager, which organizes and selects specific model versions within a directory hierarchy. It uses a filesystem watcher to detect new model versions and trigger automatic updates without int
C++
View on GitHub6,351

Open-source alternatives to Cog

huggingface/text-generation-inference

microsoft/onnxruntime

tensorflow/serving

TingsongYu/PyTorch-Tutorial-2nd

triton-inference-server/server

mlflow/mlflow

ludwig-ai/ludwig

sgl-project/sglang

pytorch/serve

PaddlePaddle/PaddleDetection

openvinotoolkit/openvino

d2l-ai/d2l-en

meta-llama/llama-models

aws-powertools/powertools-lambda-python

riffusion/riffusion-hobby

graviraja/MLOps-Basics

NixOS/nix.dev

deepjavalibrary/djl

bentoml/OpenLLM

naklecha/llama3-from-scratch

llmware-ai/llmware

RunanywhereAI/runanywhere-sdks

google-ai-edge/LiteRT-LM

bentoml/BentoML

modelscope/ms-swift

crmne/ruby_llm

mlc-ai/web-llm

OpenBMB/MiniCPM

maiot-io/zenml

dusty-nv/jetson-inference