1 dépôt
Applications specifically designed to process, clean, and manipulate large datasets.
Distinct from Java: Candidates focus on Java as a language or specific JVM implementations, not the application domain of data processing.
Explore 1 awesome GitHub repository matching data & databases · Data Processing Applications. Refine with filters or upvote what's useful.
OpenRefine is a data cleaning tool and wrangling platform used to transform raw, messy datasets into consistent and structured formats. It operates as a Java-based data processor that runs a local server and provides a web browser interface for managing and manipulating data. The platform includes a data reconciliation engine for matching local entries against external knowledge bases to standardize entities. It also functions as a web data augmentation tool, allowing users to fetch and integrate information from external web sources to enrich their datasets. The system provides a transforma
Operates as a Java-based data processor that manages and manipulates data through a web browser.