2 repositorios
Capabilities to run shell commands for each row in a dataset and ingest the resulting output as new columns.
Distinct from Shell Command Interpolations: This is a data wrangling feature using the shell as a function, distinct from general shell automation or parameter interpolation.
Explore 2 awesome GitHub repositories matching data & databases · Shell-Augmented Data Processing. Refine with filters or upvote what's useful.
VisiData is a terminal-based interactive data analysis tool and browser designed for exploring, filtering, and sorting large tabular datasets. It functions as a structured data inspector that loads and flattens complex formats like JSON, XML, and PCAP into interactive sheets, as well as a terminal file manager for navigating directories and performing staged filesystem operations. The project distinguishes itself by rendering data visualizations, such as scatter plots and histograms, directly in the terminal using Unicode Braille characters. It provides a Python-based data wrangling environme
Augments tabular data by executing shell commands for each row and capturing the output as new columns.
This project is a high-performance tabular data processing framework for R, designed to handle massive datasets with memory efficiency and speed. It provides an enhanced data structure that utilizes reference semantics and in-place modification to perform complex transformations without the overhead of unnecessary object copying. The library distinguishes itself through its low-level architectural optimizations, including multi-threaded parallel processing, radix-based sorting, and memory-mapped file parsing. By offloading critical data manipulation and aggregation routines to compiled C code
Executes command-line utilities on input files to filter or transform data before loading it into the environment.