10 repositorios
Custom implementations of grid-like structures for storing and retrieving tabular data.
Distinct from Data Grids: Shortlist candidates focus on UI data grids or high-level CSV processing, not the underlying data structure for ML
Explore 10 awesome GitHub repositories matching data & databases · Tabular Data Structures. Refine with filters or upvote what's useful.
GoLearn is a machine learning library for the Go programming language. It provides a supervised learning framework and a toolkit for building, training, and evaluating predictive models through a standardized interface. The project implements a data frame system that loads CSV files into structured grids for matrix operations. It includes a preprocessing library for discretizing continuous variables and a model evaluation toolkit that utilizes confusion matrices and cross-validation to measure precision and recall. The library covers data engineering and management, including the ability to
Implements specialized data grid structures to handle unique requirements for storing and retrieving information.
This project is an educational resource and a collection of instructional materials for performing data manipulation and statistical analysis using Python. It provides a comprehensive set of guides and code examples for using the Pandas, NumPy, and Matplotlib libraries to analyze structured data. The resource includes a dedicated guide for reshaping, cleaning, and aggregating tabular data and time series via Pandas, alongside a reference for high-performance vectorized operations and linear algebra using NumPy. It also features tutorials for creating publication-quality charts, distribution p
Supports the definition of arrays that store heterogeneous data with named fields and specific types.
VisiData is a terminal-based interactive data analysis tool and browser designed for exploring, filtering, and sorting large tabular datasets. It functions as a structured data inspector that loads and flattens complex formats like JSON, XML, and PCAP into interactive sheets, as well as a terminal file manager for navigating directories and performing staged filesystem operations. The project distinguishes itself by rendering data visualizations, such as scatter plots and histograms, directly in the terminal using Unicode Braille characters. It provides a Python-based data wrangling environme
Initializes empty tabular structures for manual data organization and collection.
This project is a collection of educational notes and tutorials focused on Python programming, scientific computing, and data analysis. It serves as a reference for learning language basics, advanced techniques, and object-oriented design. The materials include implementation guides for building linear, logistic, and convolutional neural networks using symbolic graph frameworks. It also provides instruction on manipulating and visualizing structured data frames and performing complex mathematical operations through numerical libraries. The repository includes a system for converting interact
Covers the organization of complex datasets into row-and-column formats for efficient analysis.
csvkit is a composable Unix-style command-line toolkit for converting, filtering, and analyzing CSV files directly from the terminal. It provides a suite of focused single-purpose commands that can be combined via pipes to build complex data processing workflows, with a modular architecture that includes a column-type inference engine for automatically detecting data types and a streaming-pipeline design for efficient handling of tabular data. The toolkit distinguishes itself through its SQL-engine abstraction layer, which allows users to run SQL queries directly against CSV files without req
Converts between CSV and other tabular formats such as Excel, JSON, and SQL, preserving column types and structure.
WorkerVless2sub is a serverless proxy subscription manager that runs on Cloudflare's edge network, generating optimized VMess, VLESS, and Trojan proxy links. It operates as a subscription generator that combines a default node pool with IP-replacement optimization to produce high-performance proxy configurations. The project distinguishes itself through its environment-variable-driven configuration system, allowing node parameters like host, UUID, and password to be set without code changes. It provides multiple subscription delivery methods, including a token-based quick endpoint for pre-con
Exporting proxy subscriptions in Clash or Sing-box configuration format for direct use in compatible proxy clients.
ACL4SSR is a rule set generation and conversion system that produces ready-to-use proxy configuration files for multiple client formats, including Clash, Quantumult, Surge, Surfboard, and Trojan. It combines modular rule fragments from a curated library, applies user-defined overrides, and filters traffic by domain category or geo-IP to produce optimized routing configurations. The project distinguishes itself through its format-agnostic conversion pipeline, which transforms proxy rule sets between different client formats using external open-source converters as a middleware layer. It provid
Transforms proxy rule sets between Clash, Quantumult, Surge, Surfboard, and Trojan formats using external converters.
Danfo.js es una biblioteca de análisis y preprocesamiento de datos para JavaScript que proporciona estructuras de datos etiquetadas de alto rendimiento. Implementa dataframes y series para permitir análisis de datos complejos, computación estadística y la manipulación de datos tabulares estructurados. El proyecto sirve como una biblioteca de preprocesamiento para machine learning, ofreciendo utilidades para codificación de etiquetas categóricas, one-hot encoding y escalado y estandarización de características numéricas. Facilita específicamente la conversión de estructuras de datos etiquetadas en tensores para el entrenamiento y evaluación de modelos. La biblioteca cubre un amplio conjunto de capacidades, incluyendo estadísticas descriptivas, operaciones relacionales como merge y join, y procesamiento de series temporales. Incluye herramientas para limpieza, filtrado y agrupación de datos, así como una interfaz de visualización para generar gráficos interactivos directamente desde los dataframes. El sistema soporta la importación y exportación de datos mediante formatos CSV, JSON y Excel.
Implements high-performance labeled data structures for managing relational data with column mutation and label alignment.
clashN is a cross-platform proxy manager and graphical user interface client for the Clash proxy core. It serves as a visual wrapper that enables the configuration of network tunnels and the routing of internet traffic to bypass network restrictions on desktop operating systems. The application functions by decoupling the management interface from the network engine, using a local API to dispatch configuration updates and control the proxy core. It includes a subscription converter that translates external provider profiles into standardized configuration schemas compatible with the underlyin
Accepts proxy subscription data in multiple input formats and outputs a unified Clash-compatible configuration.
Script-Hub is a system for translating proxy rule sets and converting configuration modules between different network proxy environments. It serves as a translator and converter that enables cross-platform compatibility for rewrite rules and script-based logic. The project includes a script interop layer that normalizes header casing and handles execution errors to ensure scripts run consistently across different tools. It also provides a remote configuration manager for importing, updating, and modifying proxy modules and parameters via web URLs. The software covers a range of capabilities
Transforms proxy rule sets from various formats into a single compatible version for a target application.