2 repos
Utilities that transform raw information into structured, normalized formats suitable for machine learning workflows.
Explore 2 awesome GitHub repositories matching data & databases · Data Preprocessing Utilities. Refine with filters or upvote what's useful.
Scikit-learn is a machine learning library for predictive data analysis that provides a collection of algorithms for supervised and unsupervised learning. It functions as a comprehensive toolkit for data preprocessing, dimensionality reduction, and model selection, allowing users to classify data objects, predict conti
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade