1 repo
High-performance utilities for importing large volumes of data into storage systems.
Distinguishing note: Focuses on high-throughput bulk operations rather than individual row insertion or schema management.
Explore 1 awesome GitHub repository matching data & databases · Bulk Data Ingestion. Refine with filters or upvote what's useful.
DuckDB is an in-process analytical database engine designed to run directly within an application process. As a zero-dependency, embedded system, it provides enterprise-grade SQL data processing capabilities without the overhead of managing a dedicated database server. It is built to handle complex analytical and aggregation tasks by storing and retrieving information in columns, allowing for high-performance relational data manipulation. The engine distinguishes itself through a columnar vectorized execution model that maximizes CPU cache efficiency during query operations. It employs adapti
Supports efficient batch operations to load large volumes of data while bypassing row-level overhead.