awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Data Transformation · Awesome GitHub Repositories

24 repos

Awesome GitHub RepositoriesData Transformation

Tools and utilities for modifying, restructuring, or converting raw data into desired formats and schemas.

Explore 24 awesome GitHub repositories matching data & databases · Data Transformation. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases
  3. Data Processing Pipelines
  4. Data Transformation

Awesome Data Transformation GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • d2l-ai/d2l-zh

    d2l-ai/d2l-zh

    75,708GitHubView on GitHub↗

    This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners

    Pythonbookchinesecomputer-vision
  • tesseract-ocr/tesseract

    tesseract-ocr/tesseract

    72,460GitHubView on GitHub↗

    Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into d

    C++hacktoberfestlstmmachine-learning
  • angular/angular.js

    angular/angular.js

    58,970GitHubView on GitHub↗

    AngularJS is a structural framework for building dynamic web applications by extending standard HTML with custom tags and attributes. It operates as a client-side template engine that transforms declarative markup into interactive components, organizing application logic through a model-view-controller pattern. By util

    JavaScript
  • vuejs/core

    vuejs/core

    53,019GitHubView on GitHub↗

    Vue is a progressive JavaScript framework designed for building modular, reactive user interfaces. It utilizes a component-based architecture that allows developers to encapsulate logic, templates, and styles into reusable units. At its core, the framework employs a virtual DOM renderer and a proxy-based reactivity sys

    TypeScript
Prev12Next

Explore sub-tags

  • Array and Tensor Manipulation3 sub-tagsMathematical and programmatic operations for reshaping, filtering, and transforming multi-dimensional data structures.
  • Data Aggregation Tools3 sub-tagsUtilities designed to collect, merge, and unify data from multiple disparate sources or endpoints.
  • Data Archive UtilitiesUtilities for extracting, combining, or modifying internal data archive structures.
  • Data Encoding and Serialization2 sub-tags
Libraries for converting data between binary, text, and portable interchange formats for storage or transmission.
  • Data ManipulationMethods for storing, indexing, and performing algebraic operations on structured datasets.
  • Data Parsing and Extraction5 sub-tagsTools focused on identifying, isolating, and converting raw or unstructured input into structured, schema-validated formats.
  • Filtering and Deduplication3 sub-tagsAlgorithmic methods for identifying, ranking, or removing redundant or irrelevant data entries from a collection.
  • Multimodal Data HandlersInterfaces for processing and storing binary, image, and non-textual data within AI pipelines.
  • Output Template EnginesUtilities that transform data into specific output formats using templates and field expression logic.
  • Query LanguagesImplementations and parsers for domain-specific query languages.
  • Search Result FormattersUtilities that transform raw search engine output into structured, readable formats.
  • Stream and Pipeline Orchestration5 sub-tagsFrameworks and engines for managing the flow, transformation, and distributed processing of continuous data streams.
  • Text and NLP Preprocessing2 sub-tagsSpecialized utilities for cleaning, tokenizing, and formatting text strings specifically for natural language processing or UI presentation.
  • Variant Data TypesNative engine-specific types for high-performance mathematical and structural data operations.