What are the best open-source alternatives to Mmlspark?

30 open-source projects similar to azure/mmlspark, ranked by shared features. Top picks: microsoft/synapseml, catboost/catboost, rasbt/python-machine-learning-book-2nd-edition, lightgbm-org/lightgbm, modelscope/modelscope, open-mmlab/mmagic, hiyouga/easyr1, pytorch/torchtune, internlm/xtuner, intel-analytics/bigdl.

Is microsoft/synapseml a good alternative to Mmlspark?

SynapseML is an Apache Spark machine learning library designed for building and scaling machine learning workflows and data pipelines across distributed clusters. It serves as a distributed machine learning pipeline framework and a distributed inference engine for executing hardware-accelerated pre…

Is catboost/catboost a good alternative to Mmlspark?

CatBoost is a gradient boosting machine learning library used to train decision tree ensembles for regression, classification, and ranking tasks. It functions as a high-performance framework that provides a categorical data processor for transforming non-numeric features, a distributed trainer for…

Is rasbt/python-machine-learning-book-2nd-edition a good alternative to Mmlspark?

This project is a machine learning educational resource and implementation guide for Python. It provides a collection of executable code and notebooks that demonstrate predictive modeling, data analysis workflows, and the implementation of various machine learning algorithms. The repository featur…

Is lightgbm-org/lightgbm a good alternative to Mmlspark?

LightGBM is a gradient boosting framework used to train decision tree ensembles for classification, regression, and ranking tasks. It functions as a distributed machine learning library and a decision tree ensemble implementation that utilizes leaf-wise growth and histogram-based feature binning.…

Is modelscope/modelscope a good alternative to Mmlspark?

ModelScope is a comprehensive machine learning platform that functions as a model hub, training framework, inference engine, and cloud development environment. It provides a centralized repository for discovering, downloading, and managing pre-trained models and datasets across multiple modalities,…

Is open-mmlab/mmagic a good alternative to Mmlspark?

mmagic is a multimodal training pipeline and framework for generative AI, focusing on visual synthesis and restoration. It provides the infrastructure to build and train models for tasks such as text-to-image and text-to-video generation, 3D-aware content synthesis, and high-fidelity image translat…

Is hiyouga/easyr1 a good alternative to Mmlspark?

EasyR1 is a distributed model training system and reinforcement learning framework for large language and vision-language models. It functions as a multimodal trainer and an implementation of a Proximal Policy Optimization pipeline designed to refine the reasoning and perception capabilities of mod…

Is pytorch/torchtune a good alternative to Mmlspark?

Torchtune is a PyTorch-native library for fine-tuning, aligning, and quantizing large language models. It provides a configurable training pipeline orchestrated through YAML recipes, with CLI overrides and component swapping, distributed training via FSDP2, memory optimizations, and parameter-effic…

Is internlm/xtuner a good alternative to Mmlspark?

xtuner is a comprehensive training engine for large language models, offering a toolkit for pre-training, supervised fine-tuning, and the optimization of vision-language multimodal models. It serves as a distributed training accelerator and a specialized framework for scaling Mixture-of-Experts mod…

Is intel-analytics/bigdl a good alternative to Mmlspark?

BigDL is a PyTorch acceleration framework and distributed inference engine designed for large language models. It provides a toolkit for running models on Intel hardware, integrating quantization tools and libraries for parameter-efficient fine-tuning. The project distinguishes itself through the…

Back to azure/mmlspark

Open-source alternatives to Mmlspark

30 open-source projects similar to azure/mmlspark, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Mmlspark alternative.

microsoft/synapseml
microsoft/SynapseML
5,230View on GitHub
SynapseML is an Apache Spark machine learning library designed for building and scaling machine learning workflows and data pipelines across distributed clusters. It serves as a distributed machine learning pipeline framework and a distributed inference engine for executing hardware-accelerated predictions and deep learning tasks on large-scale datasets. The project functions as a cloud AI integration layer, allowing users to apply pretrained artificial intelligence services for text, vision, and speech within distributed pipelines. It also includes a dedicated suite of tools for distributed
Scalaaiapache-sparkazure
View on GitHub5,230
catboost/catboost
catboost/catboost
8,808View on GitHub
CatBoost is a gradient boosting machine learning library used to train decision tree ensembles for regression, classification, and ranking tasks. It functions as a high-performance framework that provides a categorical data processor for transforming non-numeric features, a distributed trainer for large-scale datasets, and GPU acceleration to speed up model construction. The library distinguishes itself through native handling of categorical data and text features, removing the need for manual encoding. It includes a specialized model interpretability tool that leverages SHAP values and featu
C++big-datacatboostcategorical-features
View on GitHub8,808
rasbt/python-machine-learning-book-2nd-edition
rasbt/python-machine-learning-book-2nd-edition
7,194View on GitHub
This project is a machine learning educational resource and implementation guide for Python. It provides a collection of executable code and notebooks that demonstrate predictive modeling, data analysis workflows, and the implementation of various machine learning algorithms. The repository features practical examples of classification, regression, and clustering tasks using Scikit-Learn, alongside tutorials for building and training deep learning architectures with TensorFlow. These include implementations of convolutional and recurrent networks. The content covers a broad range of capabili
Jupyter Notebookdata-sciencedeep-learningmachine-learning
View on GitHub7,194

Open-source alternatives to Mmlspark

microsoft/SynapseML

catboost/catboost

rasbt/python-machine-learning-book-2nd-edition

lightgbm-org/LightGBM

modelscope/modelscope

open-mmlab/mmagic

hiyouga/EasyR1

pytorch/torchtune

InternLM/xtuner

intel-analytics/BigDL

Angel-ML/angel

fastai/fastai

jakevdp/PythonDataScienceHandbook

verl-project/verl

baidu/paddle

deepjavalibrary/djl

NVIDIA/Isaac-GR00T

PaddlePaddle/Serving

lyhue1991/eat_tensorflow2_in_30_days

mosaicml/composer

pycaret/pycaret

facebookresearch/flashlight

NangoHQ/nango

pytorch/vision

datawhalechina/so-large-lm

VowpalWabbit/vowpal_wabbit

microsoft/ai-edu

zihangdai/xlnet

amznlabs/amazon-dsstne

zai-org/ChatGLM3