# awslabs/deequ

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/awslabs-deequ).**

3,621 stars · 586 forks · Scala · Apache-2.0

## Links

- GitHub: https://github.com/awslabs/deequ
- awesome-repositories: https://awesome-repositories.com/repository/awslabs-deequ.md

## Description

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

## Tags

### Part of an Awesome List

- [Data Interfaces](https://awesome-repositories.com/f/awesome-lists/data/data-interfaces.md) — Library for defining unit tests for data.
- [Data Validation and Anomaly Detection](https://awesome-repositories.com/f/awesome-lists/data/data-validation-and-anomaly-detection.md) — Data quality testing library for large Apache Spark datasets.
