38 مستودعات
Structured repositories providing programmatic access to specific data sets.
Distinguishing note: Focuses on the data source aspect of icon collections.
Explore 38 awesome GitHub repositories matching data & databases · Data Sources. Refine with filters or upvote what's useful.
هذا المشروع عبارة عن دليل منسق من قبل المجتمع للبرمجيات مفتوحة المصدر المصممة للنشر في بيئات الخوادم الخاصة والمختبرات المنزلية. يعمل كمورد شامل لاكتشاف بدائل مستقلة ذاتية الاستضافة لخدمات السحابة السائدة، مما يمكن المستخدمين من الحفاظ على ملكية كاملة للبيانات والتحكم في بنيتهم التحتية الرقمية. يتم تنظيم الدليل من خلال تصنيف هرمي ينظم مجموعة واسعة من التطبيقات في فئات منطقية، تتراوح من إدارة الوسائط وتحليل البيانات إلى التواصل الخاص وأدوات إنتاجية الفريق. يتميز بعملية مراجعة أقران تعاونية، حيث يقوم أعضاء المجتمع بالتحقق من جودة وملاءمة كل طلب لضمان بقاء الدليل دقيقاً وموثوقاً. يغطي المشروع نطاقاً واسعاً من القدرات، بما في ذلك أتمتة البنية التحتية، ونشر الخدمات القائمة على الحاويات، وإدارة التكوين التصريحي. تساعد هذه الأدوات المستخدمين في الحفاظ على بيئات خادم قابلة للتكرار وإدارة تبعيات الخدمات المعقدة عبر الأجهزة الخاصة. يتم الحفاظ على الدليل كمستودع خاضع للتحكم في الإصدار، مما يضمن تتبع جميع التحديثات والتغييرات التي يقودها المجتمع وأنها شفافة.
Combines multiple disparate data sources into a unified GraphQL interface to simplify querying across different backend systems.
Gatsby is a React static site generator and hybrid rendering framework used to build websites by pre-rendering components into static HTML files for delivery via content delivery networks. It functions as a hybrid rendering platform that supports a combination of static generation, server-side rendering, and deferred page loading. The framework operates as a GraphQL data aggregator, pulling content from various APIs, headless CMS integrations, and files into a single unified schema for frontend queries. It also serves as a frontend performance optimizer, automating code splitting, resource pr
Aggregates content from disparate APIs and files into a unified GraphQL schema for build-time querying.
graphql-engine is an automated GraphQL API engine that transforms database tables and relationships into a queryable GraphQL schema. It functions as a federation gateway and mapper, instantly generating APIs with built-in filtering, pagination, and mutations from existing databases and remote schemas. The project distinguishes itself through a fine-grained access control layer that enforces row-level and field-level permissions. It further provides a real-time data subscription server that converts standard queries into live streams and a system for triggering event-driven webhooks and notifi
Merges multiple remote GraphQL schemas and data sources into a single unified API endpoint.
Simple Icons is a comprehensive repository of standardized brand logos provided in scalable vector format. It serves as a programmatic data source that offers direct access to official brand vector paths and color codes, enabling developers to integrate consistent visual assets into software projects and user interfaces. The project functions as a web-ready asset provider that supports multiple delivery methods, including direct file imports, remote image embedding, and font-based rendering. By centralizing the storage of icon geometry as raw vector path strings, it ensures consistent renderi
Offers a structured data set for direct access to brand vector paths and color codes.
Crawlee is a web scraping framework designed for building scalable, reliable, and distributed data extraction pipelines. It provides a unified interface for managing headless browser automation and lightweight HTTP requests, allowing developers to handle complex web navigation, dynamic content rendering, and large-scale data collection within a single, modular architecture. The project distinguishes itself through its resource-aware concurrency controller, which dynamically scales task execution based on real-time CPU and memory usage to prevent host machine exhaustion. It also features a rob
Integrates external data sources with internal queues to control how URLs are accessed and processed during a crawl.
This project serves as a comprehensive technical reference for the architecture and design of data-intensive applications. It provides a structured analysis of the fundamental principles required to build reliable, scalable, and maintainable software systems, covering the core trade-offs inherent in modern data infrastructure. The repository explores the mechanics of distributed data management, including strategies for replication, partitioning, and achieving consensus across multiple nodes. It details the design of storage engines, indexing techniques, and transaction management models, whi
Designates primary systems as the authoritative source of truth for all other system components.
Saleor is a headless, API-first commerce platform designed to manage complex retail operations through a decoupled architecture. It provides a centralized backend that uses a GraphQL-based interface to handle product catalogs, order lifecycles, and multi-channel sales across diverse global markets. By separating the commerce engine from the storefront, the platform enables developers to build custom, high-performance shopping experiences while maintaining granular control over data interactions. The platform distinguishes itself through an event-driven architecture that allows for deep extens
Aggregates core entities and custom metadata into a unified GraphQL interface for commerce operations.
Animeko is a cross-platform desktop media client designed to aggregate video streams and peer-to-peer content into a single, unified interface. It functions as a centralized hub for media consumption, allowing users to manage multiple content providers and playback sources within one application. The client distinguishes itself by integrating a specialized engine for real-time peer-to-peer stream buffering, which enables immediate playback of media files directly from decentralized network sources. It further enhances the viewing experience by rendering community-contributed text overlays dir
Combines video streams from various providers to automatically select the best quality source.
Vercel is a cloud platform for building, deploying, and scaling web applications. It provides a unified infrastructure that automates the build process by detecting project frameworks and distributing static and dynamic content through a global content delivery network. The platform executes application logic using serverless functions that scale automatically based on real-time traffic demand. The platform distinguishes itself through a centralized AI gateway that proxies requests to multiple model providers, enabling standardized authentication, observability, and cost tracking. It supports
Restricts information sources to ensure relevance and compliance.
This is a reference implementation of Uncle Bob's clean architecture for Android, structured into distinct domain, data, and presentation layers. The project demonstrates how to organize an Android application around business use cases, keeping domain logic and entities free from framework dependencies. The architecture enforces dependency inversion through layered separation, where inner domain layers define interfaces that outer layers implement. This approach enables repository abstractions for data source switching, presenter-view separation for testable UI logic, and use-case composition
Allows runtime or compile-time swapping of data sources (e.g., remote API vs. local cache) through repository implementations behind a common interface.
ZY-Player is a cross-platform video player and media library manager designed for streaming video and live TV content from custom or imported source lists. It functions as a media metadata browser that fetches movie ratings and information from external databases to assist in content discovery. The application integrates with external player software by handing off media streams to third-party applications. It provides tools for organizing video resources through a media library manager that supports poster views and custom source list definitions. The system includes global media search to
Imports, exports, and defines custom source lists to organize available media content.
dbt-core is a command-line framework for transforming data within a warehouse using modular SQL and version control. It functions as a data transformation engine that enables users to define data structures and business logic through declarative configuration files, which the system then compiles into executable code. By managing complex data dependencies through a directed acyclic graph, it ensures that transformation tasks execute in the correct order while maintaining a manifest-driven state to track lineage and execution history. The project distinguishes itself through an adapter-based d
Accesses information about model lineage, test results, and source freshness to help users understand data structure and quality.
This project is a centralized repository of plugins designed to integrate diverse external manga sources into a single reading interface. It functions as a content aggregation system that allows users to browse and access digital comics from multiple online platforms through a unified application. The system utilizes a framework of web scrapers that normalize data from various websites into a consistent viewing format. To manage these integrations, the project employs a background synchronization service that performs automated version checks, ensuring that installed plugins remain compatible
Manages and selects optimal media sources from multiple international providers to facilitate content discovery.
The AWS Cloud Development Kit is an infrastructure-as-code framework that enables developers to define and provision cloud resources using familiar programming languages. By utilizing construct-based synthesis, it translates high-level, object-oriented code into declarative templates, allowing for the automated management of complex cloud environments through a centralized, code-driven control plane. The framework distinguishes itself through its ability to model infrastructure as a dependency-aware resource graph, ensuring that components are provisioned and updated in the correct order. It
Combines multiple disparate data sources into a unified GraphQL interface.
This project is a command-line synchronization client for OneDrive and SharePoint libraries on Linux. It functions as a synchronization engine that aligns local filesystems with cloud storage through bidirectional, unidirectional, or download-only workflows. The client supports headless authentication for servers without web browsers and can be deployed as a background service or within a containerized environment. It enables the management of multiple distinct cloud accounts on a single system and integrates with shared SharePoint sites and document libraries. The synchronization engine inc
Implements a mirroring mode where the local filesystem is the authoritative source for remote cloud storage.
DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationships, lineage, and ownership. This foundational framework enables organizations to maintain a synchronized view of their data landscape, supporting both human-led discovery and automated data operations. The platform distinguishes itself through its focus on grounding artificial intelligence and autono
Creates native documentation within the catalog to resolve conflicting metrics or processes and provide a single source of truth.
Boto3 is the AWS SDK for Python, providing a programmatic interface for managing and automating AWS cloud infrastructure and services. It serves as a cloud management API client and resource manager for provisioning, configuring, and scaling virtual servers, databases, and storage. The library enables the implementation of infrastructure-as-code through declarative templates and scripts, allowing for the deployment of identical resource stacks across multiple accounts and geographic regions. It also provides a framework for coordinating distributed workflows, serverless functions, and contain
AWS fetches data from multiple sources through a single GraphQL API endpoint to simplify client-side querying.
Gridsome is a Vue.js static site generator designed for building Jamstack websites. It functions as a progressive web app framework that pre-renders components into static HTML files for delivery via content delivery networks. The system includes a GraphQL data orchestrator that unifies content from multiple APIs and local files into a single schema for site queries. It also integrates a frontend asset optimizer to automatically compress images and implement code-splitting. The framework provides support for offline-capable websites through prefetching pages and critical asset loading. Addit
Aggregates disparate content sources into a single GraphQL schema for unified querying during the build process.
Crawlee-python is a web crawling framework for building scalable scrapers using Python. It serves as a comprehensive tool for web scraping automation, providing a system to extract structured data from websites using both lightweight HTTP requests and headless browser automation. The framework is distinguished by its anti-bot evasion capabilities, which include browser fingerprint impersonation and tiered proxy rotation to bypass detection systems and solve challenges such as Cloudflare. It also incorporates artificial intelligence for autonomous website navigation and schema-based data extra
Integrates custom data sources to feed the list of URLs into the crawling queue.
Sui is a blockchain platform featuring an object-centric state model and resource-oriented smart contracts. It utilizes parallel transaction execution to increase network throughput and supports programmable transaction blocks that bundle multiple operations into single atomic units. The platform distinguishes itself with a capability-based access control system and zero-knowledge login mechanisms, enabling users to authenticate via identity providers without seed phrases. It also implements deterministic object addressing to allow predictable state lookups and supports the creation of soulbo
Aggregates data from indexers and archives through a unified GraphQL RPC server.