505 repos
Explore 505 awesome GitHub repositories matching category · Data & Databases. Refine with filters or upvote what's useful.
Fiber is a high-performance web framework designed for building scalable HTTP services with minimal memory overhead. It provides a comprehensive runtime environment for managing the full request lifecycle, utilizing an optimized radix tree for high-speed route matching and an object pooling system to reduce garbage collection pressure during traffic processing. The framework distinguishes itself through its multi-process architecture, which supports prefork socket reuse to distribute incoming traffic across all available CPU cores. It offers a modular approach to application development, feat
Fabric is a command-line orchestrator designed to automate complex data processing and content generation tasks by chaining artificial intelligence models with modular prompt templates. It functions as a terminal-based tool that utilizes standard input and output streams, allowing users to pipe data directly into predefined reasoning strategies. By providing a model-agnostic abstraction layer, the system decouples execution logic from specific artificial intelligence vendors, normalizing requests and responses across different service providers. The platform distinguishes itself through its p
This project is a high-performance library designed for the similarity search and clustering of dense vectors across massive datasets. It functions as a vector similarity search engine, providing the necessary tools to organize complex numerical data into specialized structures that facilitate rapid retrieval and efficient querying of millions of records. The library distinguishes itself through a variety of advanced indexing and compression techniques, including hierarchical navigable small worlds for logarithmic time complexity and inverted file indexing to partition vector spaces into mana
Appsmith is a low-code visual development platform designed for building internal business tools, administrative panels, and custom dashboards. It provides a unified editor that allows developers to compose interactive user interfaces using drag-and-drop widgets and bind them directly to backend logic, databases, and external APIs. The platform operates on a reactive model, where interface components automatically update in response to changes in data expressions or application state. The platform distinguishes itself through a modular architecture that supports the creation of reusable UI co
This project is a community-driven repository that serves as a comprehensive directory for the design industry. It provides a structured index of software, plugins, and digital assets, helping creative professionals discover and evaluate tools tailored to specific stages of the design process. The collection is maintained through a decentralized, community-driven model where external contributors submit and verify entries to ensure the information remains current. To assist users in navigating the complex ecosystem of design technology, the repository employs a hierarchical taxonomy that orga
Phaser is a comprehensive 2D game engine designed for building high-performance, interactive content that runs directly in web browsers. At its core, the engine utilizes a fixed-timestep simulation loop that decouples game logic from variable browser frame rates, ensuring consistent behavior across diverse hardware. It provides a robust framework for managing asset loading, physics, input, and audio, enabling the creation of complex, responsive visual experiences for both desktop and mobile devices. The engine distinguishes itself through a high-performance graphics pipeline that automaticall
Bark is a generative audio engine and machine learning inference library designed to convert written text into high-fidelity speech and sound effects. It functions as a text-to-audio transformer, utilizing multi-stage neural network architectures to map semantic input tokens into detailed audio codebooks for synthesis. The system distinguishes itself through a hierarchical transformer stacking approach that separates semantic understanding from acoustic realization. By employing autoregressive token prediction and vector quantized codebook mapping, the engine bridges linguistic and sonic doma
Quivr is a retrieval-augmented generation platform designed to transform raw documents into searchable knowledge bases. It functions as a centralized environment where users can ingest files, index them into vector databases, and interact with language models to receive contextually relevant, data-backed responses. The platform distinguishes itself through an agentic workflow orchestrator that sequences retrieval tasks, tool execution, and model interactions to resolve complex, multi-step queries. This engine is entirely configuration-driven, allowing users to define document ingestion, chunk
Cobalt is a cross-platform web application designed as a distributed service platform for managing media content downloading. It functions as a full-stack monorepo that integrates a backend API with a responsive frontend, providing a unified interface for users to fetch and save media files from various online platforms. The project utilizes a modular architecture where backend services, frontend interfaces, and shared logic are organized into decoupled packages within a single repository. This monorepo structure employs centralized workspace orchestration to manage dependencies and cross-pac
This project is a centralized notification infrastructure platform designed to manage multi-channel messaging workflows, delivery routing, and user preference settings through a unified integration layer. It provides a code-first workflow engine that allows engineers to define complex messaging sequences and notification logic as version-controlled code, ensuring consistency across development and deployment pipelines. The platform distinguishes itself by decoupling notification content from application logic, enabling non-technical teams to design and update templates through a visual interf
Impress.js is a browser-based presentation engine that transforms standard document elements into interactive, three-dimensional slide decks. It functions as a declarative layout framework, allowing users to define spatial relationships and visual perspectives for content within a coordinate-based canvas. By leveraging the browser's native rendering capabilities, it enables the creation of non-linear slide decks that utilize hardware-accelerated transformations for movement and perspective shifts. The framework distinguishes itself through a state-driven controller that manages active content
This project is an open-source educational curriculum designed to provide comprehensive training in data engineering. It focuses on building scalable data pipelines and managing cloud-native infrastructure through a structured, self-paced program that combines technical explanations with hands-on practical exercises. The curriculum distinguishes itself by emphasizing industry-standard methodologies, specifically teaching students how to implement infrastructure as code and manage data workflows through orchestration tools. By utilizing container-based environment isolation and declarative con
Chatbox is a cross-platform desktop application that provides a unified interface for interacting with a wide range of artificial intelligence models. It functions as a model-agnostic client, allowing users to connect to various third-party AI providers or execute open-source models directly on their own hardware. By centralizing these diverse services into a single workspace, the application enables users to manage multiple chat sessions, adjust model parameters, and switch between different AI backends with ease. The project distinguishes itself through a local-first architecture that prior
DevDocs is an offline-first web application that functions as a unified documentation aggregator. It collects multiple technical API references into a single, searchable interface, allowing users to access complex software documentation without an active internet connection. The platform distinguishes itself through a high-performance, client-side architecture that enables instant filtering and retrieval of documentation. By utilizing in-memory search indexing and local browser storage, it eliminates the need for server-side round trips, ensuring that syntax and library details are available
Nushell is a cross-platform shell and programming language designed to treat all input and output as structured data rather than raw text streams. By enforcing data types and command signatures, it provides a consistent environment for building robust, pipeline-oriented workflows. The shell allows users to chain commands that pass structured objects between stages, enabling complex data processing and automation tasks that remain predictable across different operating systems. What distinguishes the project is its focus on interactive data exploration and modular extensibility. Users can quer
LosslessCut is a desktop application designed for the precise editing of video and audio files without re-encoding the underlying media streams. By performing stream copying and container remuxing, the software allows users to cut, merge, and rearrange media segments while maintaining the original bit-perfect quality of the source content. The application distinguishes itself by utilizing a stream-copying data pipeline that transfers raw media packets directly from source to destination, significantly reducing processing time compared to traditional transcoding workflows. It also functions as
Istio is a service mesh infrastructure that provides a centralized control plane to manage, secure, and observe communication between distributed microservices. It functions as a policy-driven network traffic controller, enabling developers to route, balance, and secure service-to-service traffic without requiring modifications to application code. The system enforces zero-trust security by utilizing mutual transport layer authentication to verify cryptographic identities for every network request. The project distinguishes itself through a sidecar-less proxy architecture, which offloads netw
Halo is a modular content management platform built on the Java Virtual Machine, designed to power dynamic websites through a flexible, extensible architecture. It provides a centralized administrative interface for publishing digital content and managing media assets, serving as a foundation for diverse web projects ranging from personal blogs to corporate sites. The platform distinguishes itself through a plugin-based architecture that allows for the dynamic loading of functional components and third-party services without modifying the core source code. This extensibility is complemented b
Agno is an agent operating system designed to manage the lifecycle, tool execution, and persistent state of autonomous agents across distributed infrastructure. It provides a unified runtime environment that wraps diverse agent frameworks into a consistent, interoperable protocol, allowing developers to build and deploy complex multi-agent systems that coordinate tasks and delegate sub-processes. The platform distinguishes itself through a robust governance and orchestration layer that includes human-in-the-loop approval gates, role-based access control, and a centralized API gateway. It feat
Insomnia is a desktop application designed for the design, testing, and debugging of network requests. It serves as a comprehensive environment for managing the API lifecycle, allowing users to draft interface specifications, simulate endpoints, and execute automated testing workflows within continuous integration pipelines. The platform distinguishes itself through a modular, plugin-based architecture that enables the integration of custom scripts and external tools. It supports complex development needs by providing a local-first data persistence model, environment-variable substitution for