30 open-source projects similar to tensorflow/data-validation, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Data Validation alternative.
Algorithms for outlier, adversarial and drift detection
Pydantic is a data validation and serialization library that enforces schema constraints and performs type conversion on complex data structures. It utilizes standard Python type annotations to define data models, allowing developers to establish structured schemas that automatically enforce business rules and constraints without the need for custom domain-specific languages. The library distinguishes itself by transforming high-level model definitions into optimized code during initialization to minimize runtime overhead. It supports recursive validation for nested data structures and employ
This project is a public health dataset providing historical and real-time COVID-19 case and death counts across the United States. It consists of a collection of CSV files containing time-series pandemic data organized by date, state, and county. The dataset includes specialized records for institutional outbreaks, tracking infection and death rates within correctional facilities, colleges, and universities. It also provides statistics on excess mortality to estimate total pandemic impact and survey-based data on mask usage prevalence across different counties. To facilitate geographic anal
All about erlang programming language powerd by community
This project is a developer knowledge base and a curated library of programming code snippets. It serves as a multi-language coding reference that provides short technical articles and reusable code samples to help developers implement common programming patterns. The resource functions as a multi-language syntax reference, allowing for the comparison of logic and functionality across different programming environments. It focuses on programming pattern learning and coding skill improvement through a collection of optimized patterns and best practices. The platform includes capabilities for
Magic MCP is a Model Context Protocol server and AI component generator that translates natural language descriptions into functional user interface code. It acts as an LLM design orchestrator, producing responsive web elements and layouts anchored on utility-first CSS styling patterns. The system features a side-by-side variation engine that generates multiple stylistic interpretations of a single prompt for comparative selection. It incorporates SVG-based asset integration for branding and iconography and utilizes template-based assembly to combine pre-defined style patterns with user-speci
commandOK is a Spotlight-like command generator for your terminal. Pops up when you need it and gets out of the way when you don't.
Postman GPUI is a simple graphical user interface application for making HTTP requests, inspired by Postman. This application allows users to create, manage, and send HTTP requests and view the responses in a user-friendly manner.
ANEE is an experimental dynamic inference wrapper for pretrained Transformer language models (currently GPT-2). Instead of always running all layers, ANEE exposes an energy_budget and performs early exit inside the model’s forward pass.
Abseil is a common utility library for C++ that provides foundational building blocks for applications. It serves as a collection of optimized utility functions and data structures that augment the C++ standard library across different compiler versions. The library is distinguished by its high-performance containers, including SIMD-accelerated hash maps and sets for efficient key-value lookups. It also provides a comprehensive framework for computing absolute time points, durations, and timestamps across global time zones. The project covers a broad range of capability areas, including conc
High-level batteries-included neural network training library for Pytorch
Hub is a multimodal AI data lake and vector database designed for storing and querying embeddings, text, audio, and images. It functions as a dataset version control system and a machine learning data streaming engine to support large-scale model training. The system utilizes a serverless PostgreSQL vector store to index high-dimensional embeddings for semantic search. It provides a visual interface for inspecting multimodal datasets and viewing annotations such as bounding boxes and masks. The platform handles cloud-agnostic storage synchronization and implements lazy, compressed data strea
A curated list of awesome CircuitPython guides, videos, libraries, frameworks, software and resources.
High performance HTTP proxy originally written by your friends at Lantern and now maintained by a stellar group of volunteer open source programmers.
Pulse is a generative model image upscaler and latent space image processor. It functions as a self-supervised photo upsampling tool that increases image resolution by exploring the latent space of pre-trained generative models to synthesize high-quality details. The system includes a face image alignment tool designed to standardize the scale and orientation of raw facial photos. This preprocessing utility prepares images for higher resolution processing by aligning and downscaling faces to a standard orientation. The project covers AI image super-resolution and generative photo upscaling,
Claude and Openclaw AI skill for hotel search & booking via MoodTrip.ai MCP server
The autonomous, self-improving AI agent. Single Rust binary. Every channel.
Track and manage build artifacts from multiple programming languages.
This project is a curated directory of command line applications and utilities designed to enhance developer productivity and streamline technical workflows. It serves as a comprehensive index of open-source software, categorizing tools that assist with system administration, development automation, and personal task management. The repository distinguishes itself by providing a structured collection of terminal-based software that spans diverse functional domains. It includes resources for managing infrastructure and cloud resources, performing code maintenance, and customizing terminal envi
A Model Context Protocol (MCP) server implementation for 4EVERLAND Hosting enabling instant deployment of AI-generated code to decentralized storage networks like Greenfield, IPFS, and Arweave.