2 Repos
Generation of tables that count occurrences of distinct values to identify data distribution.
Distinct from Frequency Table Generation: Focuses on exploratory data analysis rather than counting for lossless data compression.
Explore 2 awesome GitHub repositories matching data & databases · Frequency Table Analysis. Refine with filters or upvote what's useful.
VisiData is a terminal-based interactive data analysis tool and browser designed for exploring, filtering, and sorting large tabular datasets. It functions as a structured data inspector that loads and flattens complex formats like JSON, XML, and PCAP into interactive sheets, as well as a terminal file manager for navigating directories and performing staged filesystem operations. The project distinguishes itself by rendering data visualizations, such as scatter plots and histograms, directly in the terminal using Unicode Braille characters. It provides a Python-based data wrangling environme
Generates tables showing the count of distinct values for a column to identify distribution.
Xan is a command-line tool and data transformation engine for processing CSV, TSV, and JSONL datasets. It functions as a processor for compressed files, enabling random access and seeking within gzipped and Zstd files, and serves as a converter for specialized bioinformatics data formats. The tool handles large datasets without requiring full memory loads by utilizing stream-based processing. It provides capabilities for merging, sorting, and deduplicating massive files, as well as converting data between various tabular formats. The project covers a broad range of data wrangling and analysi
Generates tables that count occurrences of distinct values to identify data distributions within columns.