# cleanlab/cleanlab

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/cleanlab-cleanlab).**

11,513 stars · 902 forks · Python · Apache-2.0

## Links

- GitHub: https://github.com/cleanlab/cleanlab
- Homepage: https://cleanlab.ai
- awesome-repositories: https://awesome-repositories.com/repository/cleanlab-cleanlab.md

## Topics

`active-learning` `annotation` `anomaly-detection` `data-annotation` `data-centric-ai` `data-cleaning` `data-curation` `data-labeling` `data-profiling` `data-quality` `data-science` `data-validation` `datasets` `exploratory-data-analysis` `labeling` `machine-learning` `noisy-labels` `out-of-distribution-detection` `outlier-detection` `weak-supervision`

## Description

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

## Tags

### Part of an Awesome List

- [Data Centric AI](https://awesome-repositories.com/f/awesome-lists/ai/data-centric-ai.md) — Identifies and cleans noisy labels in real-world datasets.
- [General Machine Learning](https://awesome-repositories.com/f/awesome-lists/ai/general-machine-learning.md) — Package for data quality and learning with noisy labels.
- [Machine Learning Libraries](https://awesome-repositories.com/f/awesome-lists/ai/machine-learning-libraries.md) — Data-centric AI for identifying label errors.
- [Data Annotation and Synthesis](https://awesome-repositories.com/f/awesome-lists/data/data-annotation-and-synthesis.md) — Library for detecting mislabeled data and data-centric AI.
- [Data Processing and Analysis](https://awesome-repositories.com/f/awesome-lists/data/data-processing-and-analysis.md) — Data-centric AI for detecting dataset issues.
- [Data Validation](https://awesome-repositories.com/f/awesome-lists/data/data-validation.md) — Library for identifying issues in real-world datasets.
