# datacharmer/test_db

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/datacharmer-test-db).**

4,388 stars · 2,693 forks · Shell

## Links

- GitHub: https://github.com/datacharmer/test_db
- awesome-repositories: https://awesome-repositories.com/repository/datacharmer-test-db.md

## Description

test_db is a collection of tools for validating database integrity, benchmarking system throughput, and generating synthetic schemas and datasets. It includes a sample corporate employee database for MySQL, a SQL dataset generator for creating representative records, and an integrity validator that uses checksums and record counts to verify data consistency across different database engines.

The project provides a database performance benchmark consisting of complex queries and stored procedures designed to measure system response times and throughput. These tools simulate real-world workloads through multi-table joins and aggregations.

The toolset covers broad capability areas including database schema simulation, synthetic data generation, and data load testing. It specifically measures data load duration by calculating the elapsed time from the initial table creation to the final update.

## Tags

### Data & Databases

- [Database Integrity Verification](https://awesome-repositories.com/f/data-databases/database-integrity-verification.md) — Provides a tool that uses checksums and record counts to verify data consistency across different database engines. ([source](https://github.com/datacharmer/test_db/blob/master/test_employees_sha.sql))
- [Environment Simulation Schemas](https://awesome-repositories.com/f/data-databases/database-table-generation/environment-simulation-schemas.md) — Creates tables, views, and stored procedures to simulate corporate database environments for testing. ([source](https://github.com/datacharmer/test_db/blob/master/objects.sql))
- [Database Population Tools](https://awesome-repositories.com/f/data-databases/population-overrides/database-population-tools.md) — Provides tools for populating database tables with large, pre-configured datasets from external dumps.
- [Dataset Generation Scripts](https://awesome-repositories.com/f/data-databases/sql-generators/dataset-generation-scripts.md) — Provides a series of scripts that populate database tables with representative records to simulate real-world workloads.
- [Join Performance Benchmarking](https://awesome-repositories.com/f/data-databases/table-data-processing/table-joining-operations/join-performance-benchmarking.md) — Evaluates system throughput by executing complex aggregation queries that force heavy resource usage across multiple tables.
- [Logic Deployment](https://awesome-repositories.com/f/data-databases/database-deployment-tools/logic-deployment.md) — Enables the deployment of stored procedures and functions to simulate complex server-side business logic. ([source](https://github.com/datacharmer/test_db#readme))
- [Sample Databases](https://awesome-repositories.com/f/data-databases/sample-databases.md) — Ships a sample corporate employee database for MySQL used for testing and benchmarking.
- [SQL Script Execution](https://awesome-repositories.com/f/data-databases/sql-script-execution.md) — Executes predefined SQL scripts from external files to provision database tables and views.
- [Stored Procedures](https://awesome-repositories.com/f/data-databases/stored-procedures.md) — Deploys complex functions and stored procedures to simulate server-side business logic.

### Artificial Intelligence & ML

- [General Synthetic Data Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/general-synthetic-data-generators.md) — Populates database tables with representative records to simulate real-world workloads for software testing.

### Part of an Awesome List

- [Sample Datasets](https://awesome-repositories.com/f/awesome-lists/data/sample-datasets.md) — Provides scripts and pre-defined datasets for populating database tables for performance benchmarking. ([source](https://github.com/datacharmer/test_db/blob/master/load_dept_manager.dump))
- [Load Testing](https://awesome-repositories.com/f/awesome-lists/devops/load-testing.md) — Evaluates the efficiency of data ingestion by measuring the time required to create schemas and populate tables.

### Software Engineering & Architecture

- [Environment Simulation](https://awesome-repositories.com/f/software-engineering-architecture/data-schema-validation/database-schema-validations/environment-simulation.md) — Creates tables, views, and stored procedures to build consistent environments for testing application logic.

### System Administration & Monitoring

- [Data Integrity Verification](https://awesome-repositories.com/f/system-administration-monitoring/data-integrity-verification.md) — Uses checksums and record counts to ensure datasets remain consistent across different database engines.

### Testing & Quality Assurance

- [Data Integrity Checksums](https://awesome-repositories.com/f/testing-quality-assurance/data-integrity-checksums.md) — Implements checksum-based verification to ensure data consistency across different database engines.
- [Query Performance Benchmarks](https://awesome-repositories.com/f/testing-quality-assurance/performance-testing-analysis/performance-diagnostics/performance-measurement/query-performance-benchmarks.md) — Measures system response times and throughput using complex queries and aggregations on standardized datasets.
- [Data Load Duration Measurement](https://awesome-repositories.com/f/testing-quality-assurance/data-load-duration-measurement.md) — Calculates the total elapsed time from initial table creation to final update to evaluate load efficiency.
- [Data Load Durations](https://awesome-repositories.com/f/testing-quality-assurance/performance-testing-analysis/performance-diagnostics/performance-measurement/query-performance-benchmarks/data-load-durations.md) — Calculates the total elapsed time from initial table creation to the final update to evaluate data ingestion efficiency. ([source](https://github.com/datacharmer/test_db/blob/master/show_elapsed.sql))
