2 مستودعات
Applies per-column options such as date format parsing, null-value substitution, and whitespace trimming as data is read.
Distinct from Field Transformations: Distinct from Field Transformations: focuses on transformations applied specifically during CSV loading, not general field transformations.
Explore 2 awesome GitHub repositories matching data & databases · CSV. Refine with filters or upvote what's useful.
csvkit is a composable Unix-style command-line toolkit for converting, filtering, and analyzing CSV files directly from the terminal. It provides a suite of focused single-purpose commands that can be combined via pipes to build complex data processing workflows, with a modular architecture that includes a column-type inference engine for automatically detecting data types and a streaming-pipeline design for efficient handling of tabular data. The toolkit distinguishes itself through its SQL-engine abstraction layer, which allows users to run SQL queries directly against CSV files without req
Displays column names, data types, and summary statistics of a CSV file to understand its contents.
pgloader is a command-line tool that automates the migration of data and schema from various source databases and file formats into PostgreSQL. It combines schema discovery, parallel data pipelines, and type casting into a single, declarative workflow, using PostgreSQL's COPY protocol for high-throughput bulk loading. The tool distinguishes itself by compiling a dedicated command language into concurrent reader-writer pipelines that handle schema introspection, data transformation, and error-resilient batch processing. It supports migrating entire databases from MySQL, MS SQL, SQLite, and Pos
Applies per-column options such as date format parsing, null-value substitution, and whitespace trimming during CSV loading.