Why is awesome-selfhosted/awesome-selfhosted a recommended Data Sources GitHub Repositories repository?

Combines multiple disparate data sources into a unified GraphQL interface to simplify querying across different backend systems.

Why is gatsbyjs/gatsby a recommended Data Sources GitHub Repositories repository?

Aggregates content from disparate APIs and files into a unified GraphQL schema for build-time querying.

Why is hasura/graphql-engine a recommended Data Sources GitHub Repositories repository?

Merges multiple remote GraphQL schemas and data sources into a single unified API endpoint.

Why is simple-icons/simple-icons a recommended Data Sources GitHub Repositories repository?

Offers a structured data set for direct access to brand vector paths and color codes.

Why is apify/crawlee a recommended Data Sources GitHub Repositories repository?

Integrates external data sources with internal queues to control how URLs are accessed and processed during a crawl.

Why is vonng/ddia a recommended Data Sources GitHub Repositories repository?

Designates primary systems as the authoritative source of truth for all other system components.

Why is saleor/saleor a recommended Data Sources GitHub Repositories repository?

Aggregates core entities and custom metadata into a unified GraphQL interface for commerce operations.

Why is open-ani/animeko a recommended Data Sources GitHub Repositories repository?

Combines video streams from various providers to automatically select the best quality source.

Why is vercel/vercel a recommended Data Sources GitHub Repositories repository?

Restricts information sources to ensure relevance and compliance.

Why is android10/android-cleanarchitecture a recommended Data Sources GitHub Repositories repository?

Allows runtime or compile-time swapping of data sources (e.g., remote API vs. local cache) through repository implementations behind a common interface.

38 مستودعات

Awesome GitHub RepositoriesData Sources

Structured repositories providing programmatic access to specific data sets.

Distinguishing note: Focuses on the data source aspect of icon collections.

Explore 38 awesome GitHub repositories matching data & databases · Data Sources. Refine with filters or upvote what's useful.

اعثر على أفضل المستودعات باستخدام الذكاء الاصطناعي.سنبحث عن أفضل المستودعات المطابقة باستخدام الذكاء الاصطناعي.

awesome-selfhosted/awesome-selfhosted
awesome-selfhosted/awesome-selfhosted
299,516عرض على GitHub
هذا المشروع عبارة عن دليل منسق من قبل المجتمع للبرمجيات مفتوحة المصدر المصممة للنشر في بيئات الخوادم الخاصة والمختبرات المنزلية. يعمل كمورد شامل لاكتشاف بدائل مستقلة ذاتية الاستضافة لخدمات السحابة السائدة، مما يمكن المستخدمين من الحفاظ على ملكية كاملة للبيانات والتحكم في بنيتهم التحتية الرقمية. يتم تنظيم الدليل من خلال تصنيف هرمي ينظم مجموعة واسعة من التطبيقات في فئات منطقية، تتراوح من إدارة الوسائط وتحليل البيانات إلى التواصل الخاص وأدوات إنتاجية الفريق. يتميز بعملية مراجعة أقران تعاونية، حيث يقوم أعضاء المجتمع بالتحقق من جودة وملاءمة كل طلب لضمان بقاء الدليل دقيقاً وموثوقاً. يغطي المشروع نطاقاً واسعاً من القدرات، بما في ذلك أتمتة البنية التحتية، ونشر الخدمات القائمة على الحاويات، وإدارة التكوين التصريحي. تساعد هذه الأدوات المستخدمين في الحفاظ على بيئات خادم قابلة للتكرار وإدارة تبعيات الخدمات المعقدة عبر الأجهزة الخاصة. يتم الحفاظ على الدليل كمستودع خاضع للتحكم في الإصدار، مما يضمن تتبع جميع التحديثات والتغييرات التي يقودها المجتمع وأنها شفافة.
Combines multiple disparate data sources into a unified GraphQL interface to simplify querying across different backend systems.
awesomeawesome-listcloud
عرض على GitHub299,516
gatsbyjs/gatsby
gatsbyjs/gatsby
55,941عرض على GitHub
Gatsby is a React static site generator and hybrid rendering framework used to build websites by pre-rendering components into static HTML files for delivery via content delivery networks. It functions as a hybrid rendering platform that supports a combination of static generation, server-side rendering, and deferred page loading. The framework operates as a GraphQL data aggregator, pulling content from various APIs, headless CMS integrations, and files into a single unified schema for frontend queries. It also serves as a frontend performance optimizer, automating code splitting, resource pr
Aggregates content from disparate APIs and files into a unified GraphQL schema for build-time querying.
JavaScriptblogcompilergatsby
عرض على GitHub55,941
hasura/graphql-engine
hasura/graphql-engine
32,064عرض على GitHub
graphql-engine is an automated GraphQL API engine that transforms database tables and relationships into a queryable GraphQL schema. It functions as a federation gateway and mapper, instantly generating APIs with built-in filtering, pagination, and mutations from existing databases and remote schemas. The project distinguishes itself through a fine-grained access control layer that enforces row-level and field-level permissions. It further provides a real-time data subscription server that converts standard queries into live streams and a system for triggering event-driven webhooks and notifi
Merges multiple remote GraphQL schemas and data sources into a single unified API endpoint.
TypeScriptaccess-controlapiautomatic-api
عرض على GitHub32,064
simple-icons/simple-icons
simple-icons/simple-icons
24,495عرض على GitHub
Simple Icons is a comprehensive repository of standardized brand logos provided in scalable vector format. It serves as a programmatic data source that offers direct access to official brand vector paths and color codes, enabling developers to integrate consistent visual assets into software projects and user interfaces. The project functions as a web-ready asset provider that supports multiple delivery methods, including direct file imports, remote image embedding, and font-based rendering. By centralizing the storage of icon geometry as raw vector path strings, it ensures consistent renderi
Offers a structured data set for direct access to brand vector paths and color codes.
JavaScriptbrandbrand-assetsbrand-colors
عرض على GitHub24,495
apify/crawlee
apify/crawlee
24,002عرض على GitHub
Crawlee is a web scraping framework designed for building scalable, reliable, and distributed data extraction pipelines. It provides a unified interface for managing headless browser automation and lightweight HTTP requests, allowing developers to handle complex web navigation, dynamic content rendering, and large-scale data collection within a single, modular architecture. The project distinguishes itself through its resource-aware concurrency controller, which dynamically scales task execution based on real-time CPU and memory usage to prevent host machine exhaustion. It also features a rob
Integrates external data sources with internal queues to control how URLs are accessed and processed during a crawl.
TypeScriptapifyautomationcrawler
عرض على GitHub24,002
vonng/ddia
Vonng/ddia
22,648عرض على GitHub
This project serves as a comprehensive technical reference for the architecture and design of data-intensive applications. It provides a structured analysis of the fundamental principles required to build reliable, scalable, and maintainable software systems, covering the core trade-offs inherent in modern data infrastructure. The repository explores the mechanics of distributed data management, including strategies for replication, partitioning, and achieving consensus across multiple nodes. It details the design of storage engines, indexing techniques, and transaction management models, whi
Designates primary systems as the authoritative source of truth for all other system components.
Pythonbookdatabaseddia
عرض على GitHub22,648
saleor/saleor
saleor/saleor
22,610عرض على GitHub
Saleor is a headless, API-first commerce platform designed to manage complex retail operations through a decoupled architecture. It provides a centralized backend that uses a GraphQL-based interface to handle product catalogs, order lifecycles, and multi-channel sales across diverse global markets. By separating the commerce engine from the storefront, the platform enables developers to build custom, high-performance shopping experiences while maintaining granular control over data interactions. The platform distinguishes itself through an event-driven architecture that allows for deep extens
Aggregates core entities and custom metadata into a unified GraphQL interface for commerce operations.
Pythoncartcheckoutcommerce
عرض على GitHub22,610
open-ani/animeko
open-ani/animeko
18,500عرض على GitHub
Animeko is a cross-platform desktop media client designed to aggregate video streams and peer-to-peer content into a single, unified interface. It functions as a centralized hub for media consumption, allowing users to manage multiple content providers and playback sources within one application. The client distinguishes itself by integrating a specialized engine for real-time peer-to-peer stream buffering, which enables immediate playback of media files directly from decentralized network sources. It further enhances the viewing experience by rendering community-contributed text overlays dir
Combines video streams from various providers to automatically select the best quality source.
Kotlinandroidanianime
عرض على GitHub18,500
vercel/vercel
vercel/vercel
15,738عرض على GitHub
Vercel is a cloud platform for building, deploying, and scaling web applications. It provides a unified infrastructure that automates the build process by detecting project frameworks and distributing static and dynamic content through a global content delivery network. The platform executes application logic using serverless functions that scale automatically based on real-time traffic demand. The platform distinguishes itself through a centralized AI gateway that proxies requests to multiple model providers, enabling standardized authentication, observability, and cost tracking. It supports
Restricts information sources to ensure relevance and compliance.
TypeScriptclicloudcommand
عرض على GitHub15,738
android10/android-cleanarchitecture
android10/Android-CleanArchitecture
15,540عرض على GitHub
This is a reference implementation of Uncle Bob's clean architecture for Android, structured into distinct domain, data, and presentation layers. The project demonstrates how to organize an Android application around business use cases, keeping domain logic and entities free from framework dependencies. The architecture enforces dependency inversion through layered separation, where inner domain layers define interfaces that outer layers implement. This approach enables repository abstractions for data source switching, presenter-view separation for testable UI logic, and use-case composition
Allows runtime or compile-time swapping of data sources (e.g., remote API vs. local cache) through repository implementations behind a common interface.
Javaandroidandroid-applicationandroid-architecture
عرض على GitHub15,540
hunlongyu/zy-player
Hunlongyu/ZY-Player
14,516عرض على GitHub
ZY-Player is a cross-platform video player and media library manager designed for streaming video and live TV content from custom or imported source lists. It functions as a media metadata browser that fetches movie ratings and information from external databases to assist in content discovery. The application integrates with external player software by handing off media streams to third-party applications. It provides tools for organizing video resources through a media library manager that supports poster views and custom source list definitions. The system includes global media search to
Imports, exports, and defines custom source lists to organize available media content.
Vueelectronelectron-appelectron-application
عرض على GitHub14,516
dbt-labs/dbt-core
dbt-labs/dbt-core
13,051عرض على GitHub
dbt-core is a command-line framework for transforming data within a warehouse using modular SQL and version control. It functions as a data transformation engine that enables users to define data structures and business logic through declarative configuration files, which the system then compiles into executable code. By managing complex data dependencies through a directed acyclic graph, it ensures that transformation tasks execute in the correct order while maintaining a manifest-driven state to track lineage and execution history. The project distinguishes itself through an adapter-based d
Accesses information about model lineage, test results, and source freshness to help users understand data structure and quality.
Rustanalyticsbusiness-intelligencedata-modeling
عرض على GitHub13,051
keiyoushi/extensions
keiyoushi/extensions
13,045عرض على GitHub
This project is a centralized repository of plugins designed to integrate diverse external manga sources into a single reading interface. It functions as a content aggregation system that allows users to browse and access digital comics from multiple online platforms through a unified application. The system utilizes a framework of web scrapers that normalize data from various websites into a consistent viewing format. To manage these integrations, the project employs a background synchronization service that performs automated version checks, ensuring that installed plugins remain compatible
Manages and selects optimal media sources from multiple international providers to facilitate content discovery.
HTML
عرض على GitHub13,045
aws/aws-cdk
aws/aws-cdk
12,817عرض على GitHub
The AWS Cloud Development Kit is an infrastructure-as-code framework that enables developers to define and provision cloud resources using familiar programming languages. By utilizing construct-based synthesis, it translates high-level, object-oriented code into declarative templates, allowing for the automated management of complex cloud environments through a centralized, code-driven control plane. The framework distinguishes itself through its ability to model infrastructure as a dependency-aware resource graph, ensuring that components are provisioned and updated in the correct order. It
Combines multiple disparate data sources into a unified GraphQL interface.
TypeScriptawscloud-infrastructurehacktoberfest
عرض على GitHub12,817
abraunegg/onedrive
abraunegg/onedrive
12,577عرض على GitHub
This project is a command-line synchronization client for OneDrive and SharePoint libraries on Linux. It functions as a synchronization engine that aligns local filesystems with cloud storage through bidirectional, unidirectional, or download-only workflows. The client supports headless authentication for servers without web browsers and can be deployed as a background service or within a containerized environment. It enables the management of multiple distinct cloud accounts on a single system and integrates with shared SharePoint sites and document libraries. The synchronization engine inc
Implements a mirroring mode where the local filesystem is the authoritative source for remote cloud storage.
D
عرض على GitHub12,577
datahub-project/datahub
datahub-project/datahub
12,141عرض على GitHub
DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationships, lineage, and ownership. This foundational framework enables organizations to maintain a synchronized view of their data landscape, supporting both human-led discovery and automated data operations. The platform distinguishes itself through its focus on grounding artificial intelligence and autono
Creates native documentation within the catalog to resolve conflicting metrics or processes and provide a single source of truth.
Pythondata-catalogdata-discoverydata-governance
عرض على GitHub12,141
boto/boto3
boto/boto3
9,834عرض على GitHub
Boto3 is the AWS SDK for Python, providing a programmatic interface for managing and automating AWS cloud infrastructure and services. It serves as a cloud management API client and resource manager for provisioning, configuring, and scaling virtual servers, databases, and storage. The library enables the implementation of infrastructure-as-code through declarative templates and scripts, allowing for the deployment of identical resource stacks across multiple accounts and geographic regions. It also provides a framework for coordinating distributed workflows, serverless functions, and contain
AWS fetches data from multiple sources through a single GraphQL API endpoint to simplify client-side querying.
Pythonawsaws-sdkcloud
عرض على GitHub9,834
gridsome/gridsome
gridsome/gridsome
8,484عرض على GitHub
Gridsome is a Vue.js static site generator designed for building Jamstack websites. It functions as a progressive web app framework that pre-renders components into static HTML files for delivery via content delivery networks. The system includes a GraphQL data orchestrator that unifies content from multiple APIs and local files into a single schema for site queries. It also integrates a frontend asset optimizer to automatically compress images and implement code-splitting. The framework provides support for offline-capable websites through prefetching pages and critical asset loading. Addit
Aggregates disparate content sources into a single GraphQL schema for unified querying during the build process.
JavaScript
عرض على GitHub8,484
apify/crawlee-python
apify/crawlee-python
8,097عرض على GitHub
Crawlee-python is a web crawling framework for building scalable scrapers using Python. It serves as a comprehensive tool for web scraping automation, providing a system to extract structured data from websites using both lightweight HTTP requests and headless browser automation. The framework is distinguished by its anti-bot evasion capabilities, which include browser fingerprint impersonation and tiered proxy rotation to bypass detection systems and solve challenges such as Cloudflare. It also incorporates artificial intelligence for autonomous website navigation and schema-based data extra
Integrates custom data sources to feed the list of URLs into the crawling queue.
Pythonapifyautomationbeautifulsoup
عرض على GitHub8,097
mystenlabs/sui
MystenLabs/sui
7,612عرض على GitHub
Sui is a blockchain platform featuring an object-centric state model and resource-oriented smart contracts. It utilizes parallel transaction execution to increase network throughput and supports programmable transaction blocks that bundle multiple operations into single atomic units. The platform distinguishes itself with a capability-based access control system and zero-knowledge login mechanisms, enabling users to authenticate via identity providers without seed phrases. It also implements deterministic object addressing to allow predictable state lookups and supports the creation of soulbo
Aggregates data from indexers and archives through a unified GraphQL RPC server.
Rustblockchaindistributed-ledger-technologymove
عرض على GitHub7,612

Awesome Data Sources GitHub Repositories

awesome-selfhosted/awesome-selfhosted

gatsbyjs/gatsby

hasura/graphql-engine

simple-icons/simple-icons

apify/crawlee

Vonng/ddia

saleor/saleor

open-ani/animeko

vercel/vercel

android10/Android-CleanArchitecture

Hunlongyu/ZY-Player

dbt-labs/dbt-core

keiyoushi/extensions

aws/aws-cdk

abraunegg/onedrive

datahub-project/datahub

boto/boto3

gridsome/gridsome

apify/crawlee-python

MystenLabs/sui

استكشف الوسوم الفرعية