38 Repos
Utilities for converting structured data between different serialization formats like CSV, JSON, and YAML.
Distinct from Financial Data Format Translation: Closest candidates focus on errors, subscriptions, or financial data, whereas this is a general-purpose tabular format translator.
Explore 38 awesome GitHub repositories matching data & databases · Data Format Translators. Refine with filters or upvote what's useful.
Agent-Reach is an AI agent web gateway and search tool that provides language models with the ability to search and read content from the open web, social media, and community forums without using official APIs. It functions as a routing layer that connects large language models to various internet backends while managing content parsing and connection health. The system enables API-free information retrieval by using open-source backends to extract text and metadata from platforms such as Twitter, Reddit, and YouTube. It converts unstructured website content, RSS feeds, and video transcripts
Converts diverse platform data formats into a standardized schema for consistent consumption by language models.
Vegeta is an HTTP load testing tool and library designed to measure the performance and stability of web services. It functions as a command-line utility, a programmable package for integration into other applications, and a distributed load generator capable of splitting request rates across multiple machines. The tool is distinguished by its constant-rate request scheduler, which dispatches requests at a fixed frequency regardless of target response times. It employs lazy target streaming to maintain low memory usage during large tests and uses a binary-encoded storage format to minimize di
Translates result files between binary, CSV, and JSON formats for compatibility with analysis tools.
Labelme ist ein Python-basiertes Bildannotationstool, das zur Erstellung von Datensätzen für Computer Vision verwendet wird. Es dient als visueller Editor für semantische Segmentierung und ermöglicht es Benutzern, Objektgrenzen mithilfe von Polygonen, Rechtecken, Punkten und Kreisen zu definieren. Die Anwendung fungiert auch als Annotator für multispektrale Bilder und unterstützt TIFF-Dateien mit hoher Bittiefe, die in der Satelliten- und wissenschaftlichen Bildgebung verwendet werden. Das Tool integriert KI-gestützte Labeling-Funktionen, um die Erstellung von Masken und Polygonen zu automatisieren. Diese Funktionen ermöglichen die Formgenerierung durch Texteingaben oder interaktive Punktauswahlen, die Grenzen basierend auf vom Benutzer platzierten positiven und negativen Punkten vorschlagen. Die Software deckt ein breites Spektrum an Datenverwaltungs- und Annotationsaufgaben ab, einschließlich der Erstellung dichter Pixelmasken, rotierter Bounding Boxes und Videobildsequenzierung. Sie enthält eine Pipeline zur Übersetzung der internen JSON-Zustandspersistenz in Standard-Datensatzformate wie COCO und Pascal VOC. Zu den weiteren Funktionen gehören Klassifizierungs-Flags auf Bildebene, Geometrie-Verfeinerungstools und der Batch-Import von Bildern.
Converts internal JSON annotation representations into standard computer vision dataset formats like COCO and Pascal VOC.
Jimp is a zero-dependency JavaScript image processing library and programmatic editor designed for manipulating, resizing, and filtering images in Node.js. It functions as a multi-format image encoder and extensible pipeline that operates entirely in JavaScript to ensure portable deployment across different environments without requiring native system dependencies. The engine features a modular architecture that allows for custom image processor builds and the registration of custom processing plugins. This extensibility enables the addition of specific visual effects and custom file-type enc
Includes dedicated translation modules to encode and decode raw pixel data across multiple image formats.
Tiled is a visual editor for creating two-dimensional game environments, functioning as a tile map editor and game level design tool. It allows for the construction of game worlds by arranging reusable tile sets and assets into maps. The system includes a tile set manager to organize and modify collections of tiles that can be shared across multiple project maps. It supports the creation of complex environments through layered map composition and the placement of objects at precise coordinates. The editor provides capabilities for grid alignment, coordinate-based layouts, and the attachment
Converts internal map data into structured formats like JSON or XML for integration with external game engines.
Coil is an image loading and caching pipeline designed for Android and Compose Multiplatform applications. It functions as a comprehensive loader, caching engine, and rendering utility that asynchronously fetches and displays images from network URLs, local storage, and multiplatform resource systems. The library distinguishes itself through a flexible fetcher-decoder pipeline and an interface-driven component registry, allowing for the integration of custom networking clients and decoders. It provides specialized support for rendering scalable vector graphics, animated formats such as GIF an
Transforms image data between bitmaps, drawables, and painters to ensure compatibility with different rendering engines.
fq ist ein Command-Line-Binärdatenprozessor, der zum Dekodieren, Transformieren und Analysieren von Roh-Byte-Streams und Bit-Level-Daten in strukturierte Formate verwendet wird. Er fungiert als funktionale Binär-Query-Engine, die das Filtern und Mappen binärer Strukturen ermöglicht, sowie als Konverter, der komplexe Binär-Blobs und proprietäre Dateiformate in Standard-JSON, YAML oder XML übersetzt. Das Tool zeichnet sich als Low-Level-Bit-Manipulator aus, der Bit-Level-Slicing, bitweise Operationen und kryptografisches Hashing auf Rohdateien durchführen kann. Es dient zudem als Netzwerkprotokoll-Analysator mit der Fähigkeit, fragmentierte TCP-Streams wieder zusammenzusetzen und TLS-Traffic für die Inspektion auf Anwendungsebene zu entschlüsseln. Das Projekt deckt breite Funktionen in der Binär-Analyse und Datentransformation ab, einschließlich Unterstützung für benutzerdefinierte Decoder-Definitionen und eine breite Palette spezialisierter Formate wie Mach-O, ASN1 BER und Avro OCF. Es bietet Utilities für Binärbaum-Suche, strukturierte Textdekodierung und bidirektionale Serialisierung zwischen Binär- und Textformaten. Benutzer können über ein Command-Line-Interface und ein interaktives REPL für Echtzeit-Query-Tests mit dem System interagieren.
Translates structured data between binary, text-based serialization formats, and various number bases.
RestKit is an iOS REST API integration framework used to connect Apple platform applications to web services. It centers on RESTful resource modeling, transforming remote API data structures into typed local objects and database entities. The framework provides capabilities for remote data persistence by caching web service resources in a local database to enable offline access. It also handles network data serialization, using pluggable parsers to convert JSON or XML responses into application data models. Additional functionality includes multipart file uploading for sending binary data an
Convert incoming data streams from formats like JSON or XML into usable objects using a pluggable interface.
MAME is a vintage hardware emulation platform designed to recreate the circuitry of arcade games, computers, and consoles to run original software on modern devices. It functions as a retro gaming preservation framework for managing, verifying, and archiving ROM sets and disk images to ensure long-term software accessibility. The project features a system debugging tool for inspecting emulated memory, CPU registers, and execution flow via breakpoints and disassembly. It also includes a Lua-based automation layer that exposes core system state and hardware controls for custom behavior and anal
Translates between various on-disk image file formats and internal magnetic state representations.
Miller is a command-line data processor used for filtering, transforming, and aggregating name-indexed tabular data. It functions as a tool for querying and reshaping records across multiple file formats, serving as a converter between CSV, JSON, and YAML. The tool distinguishes itself by using a name-indexed data model, allowing users to manipulate fields by name rather than numeric position. It utilizes single-pass streaming algorithms to compute statistics and summaries on large datasets that exceed available system memory. Its capabilities cover data transformation and analysis, includin
Provides a common internal representation to convert seamlessly between CSV, JSON, and YAML formats.
Jackson is a Java data binding framework and multi-format data serializer used to translate data structures into native language objects. It functions as a JSON data binding library and a streaming parser that reads and writes data as discrete tokens to process large datasets with minimal memory. The project distinguishes itself through a bytecode serialization accelerator that replaces standard reflection with generated bytecode to increase data binding speed. It employs a module-based extensibility model to support a wide range of formats beyond JSON, including XML, YAML, CSV, TOML, and bin
Provides pluggable interfaces to convert incoming data streams from multiple formats like XML and YAML into typed objects.
Delta is a lakehouse table format that brings ACID transactions and data warehouse consistency to large scale data lakes on cloud object storage. It serves as an ACID transaction manager, coordinating atomic commits and serializable isolation for concurrent reads and writes across distributed compute engines. The project provides a multi-engine interoperability layer that uses format translation to allow diverse SQL engines and processing frameworks to read and write the same tables. It functions as a data versioning system, utilizing a transaction log to enable time travel, historical snapsh
Generates compatible metadata for external open table formats to allow cross-engine interoperability without data duplication.
Syft is a software bill of materials generator, container image scanner, and software dependency catalog. It analyzes container images and filesystems to produce comprehensive inventories of installed packages and dependencies in standard formats. Additionally, it serves as a software attestation tool and an SBOM format converter. The project distinguishes itself through the ability to create cryptographically signed attestations for software inventories to ensure provenance and integrity. It also provides the capability to transform software bills of materials between different industry sche
Transforms internal software inventory data between different industry-standard SBOM serialization formats.
Dream Textures is a Stable Diffusion integration for Blender that provides tools for text-to-image generation, depth projection, and node-based processing within a 3D environment. It functions as an AI texture generator capable of producing image textures and concept art from text prompts and scene renders. The system features a depth-to-image projection tool that maps generated imagery onto 3D models using depth data for spatial alignment. It also includes a node-based AI image processor for creating procedural visual effects and a dedicated toolset for AI-assisted inpainting and outpainting
Converts raw pixel data between grayscale, RGB, and RGBA formats for software compatibility.
X-AnyLabeling is an AI-assisted annotation platform and computer vision labeling tool. It provides an interface for annotating images and videos using polygons and rectangles to create training sets for machine learning models. The project distinguishes itself through the integration of external AI models via a plugin-based inference backend, allowing for automated generation of candidate labels and the execution of specialized tasks like pose estimation and object detection. It also functions as an optical character recognition tool for extracting text and layout information from document im
Maps annotation data between various industry standards to ensure compatibility across different machine learning tools.
DevOps-Bash-tools is a collection of shell scripts and aliases designed to automate cloud infrastructure, container orchestration, and CI/CD pipelines. It provides a comprehensive toolset for managing operational workflows through the command line. The project specializes in automating tasks across multiple platforms, including managing namespaces and secrets in Kubernetes, auditing resources in AWS and GCP, and triggering builds or managing environment variables in GitHub Actions, GitLab CI, and CircleCI. It also includes a toolkit for interacting with container registries to query manifests
Convert between JSON and YAML and validate data files.
QOI is a lossless image codec and encoding standard designed for high-speed compression and decompression of raw pixel data. It provides a toolkit for translating raw image buffers into a compact format and back into pixel representations without any loss of quality. The implementation focuses on fast image encoding and decoding, enabling the rapid conversion of compressed image data back into raw pixels. It also supports image format conversion to ensure compatibility across different software systems and hardware.
Translates image data between different formats to ensure cross-system and hardware compatibility.
MMPose is a PyTorch-based pose estimation toolbox and deep learning training pipeline designed for detecting 2D and 3D keypoints on humans, animals, and faces. It serves as a computer vision model zoo and a framework for both 2D pose estimation and 3D pose lifting. The project is distinguished by its modular architecture and extensibility, employing a registry-based system and hierarchical configurations to allow for custom algorithm integration and model pipeline customization. It supports diverse estimation paradigms, including top-down, bottom-up, and two-stage pose lifting workflows. The
Includes utilities to translate raw dataset annotations into standardized formats like COCO.
This project is an administrative GIS toolset that provides a comprehensive dataset of China's administrative divisions, including provinces, cities, districts, and townships. It functions as a coordinate system transformer and a boundary converter for transforming geographic data into standard formats. The toolset distinguishes itself through the ability to convert administrative boundary data between CSV, GeoJSON, Shapefiles, and SQL. It includes specialized utilities for coordinate system transformation between GCJ-02, BD-09, WGS-84, and CGCS2000 standards to ensure accuracy across differe
Translates administrative data between specialized spatial formats like GeoJSON and SHP and tabular structures like CSV.
Microsandbox is a runtime for creating and managing lightweight, hardware-isolated virtual machines — called sandboxes — that boot directly from standard OCI container images. Each sandbox runs as its own host process with a separate kernel, filesystem, and network stack, providing process-per-sandbox isolation. The project includes a command-line tool and multi-language SDKs (Rust, TypeScript, Python, Go) for programmatic lifecycle control, and it communicates with sandbox agents over Unix sockets using a CBOR-encoded protocol. What distinguishes Microsandbox is its combination of host-manag
Automatically detects disk image format from file extension or accepts explicit type hints for booting.