Why is alibaba/datax a recommended SQL Data Retrieval GitHub Repositories repository?

Implements techniques for filtering and extracting specific data from relational tables using SQL WHERE clauses.

Why is risingwavelabs/risingwave a recommended SQL Data Retrieval GitHub Repositories repository?

Queries real-time data directly using a built-in serving layer and standard SQL.

Why is mouredev/hello-sql a recommended SQL Data Retrieval GitHub Repositories repository?

Provides guides on writing queries to extract and aggregate specific information from relational tables.

Why is redpanda-data/connect a recommended SQL Data Retrieval GitHub Repositories repository?

Extracts, filters, and aggregates data from relational tables using standard SQL query language.

Why is nlpchina/elasticsearch-sql a recommended SQL Data Retrieval GitHub Repositories repository?

Provides the ability to retrieve, filter, sort, and group data from indices using standard SQL syntax.

Why is apache/pinot a recommended SQL Data Retrieval GitHub Repositories repository?

Exposes a tabular data model for retrieving and analyzing information using standard SQL syntax.

Why is trailbaseio/trailbase a recommended SQL Data Retrieval GitHub Repositories repository?

Allows direct execution of SQL queries for complex data modeling and retrieval.

Why is biopython/biopython a recommended SQL Data Retrieval GitHub Repositories repository?

Extracts biological records from relational databases on demand as sequence record objects.

8 مستودعات

Awesome GitHub RepositoriesSQL Data Retrieval

Techniques for extracting, filtering, and aggregating data from relational tables using SQL.

Distinguishing note: None of the candidates focus on the general educational practice of writing retrieval queries; they focus on loaders or distributed engines.

Explore 8 awesome GitHub repositories matching data & databases · SQL Data Retrieval. Refine with filters or upvote what's useful.

اعثر على أفضل المستودعات باستخدام الذكاء الاصطناعي.سنبحث عن أفضل المستودعات المطابقة باستخدام الذكاء الاصطناعي.

alibaba/datax
alibaba/DataX
17,241عرض على GitHub
DataX is a distributed data integration framework and plugin-based ETL tool designed for synchronizing large datasets between heterogeneous sources and destinations. It functions as a JDBC data migration engine and offline synchronization tool, enabling the movement of data between relational databases, NoSQL stores, and object storage. The system utilizes a plugin-based connector architecture that decouples reader and writer logic, allowing it to map and transform data types across different storage engines using a standardized internal representation. This design supports heterogeneous data
Implements techniques for filtering and extracting specific data from relational tables using SQL WHERE clauses.
Java
عرض على GitHub17,241
risingwavelabs/risingwave
risingwavelabs/risingwave
9,093عرض على GitHub
RisingWave is a cloud-native streaming database and real-time analytics engine that uses standard SQL to process continuous data streams. It functions as a streaming data lakehouse, combining the capabilities of a streaming SQL database with a platform that integrates streaming ingestion with open table formats. The system is distinguished by its use of the PostgreSQL wire protocol, allowing it to integrate with existing SQL tools and drivers. It employs a decoupled compute and storage architecture, persisting streaming state and materialized views in cloud object storage to enable independen
Queries real-time data directly using a built-in serving layer and standard SQL.
Rustapache-icebergdata-engineeringdatabase
عرض على GitHub9,093
mouredev/hello-sql
mouredev/hello-sql
8,826عرض على GitHub
hello-sql is a collection of educational resources and practical guides designed for mastering relational database design, SQL query writing, and schema mapping. It provides a set of lessons and exercises for practicing the creation and manipulation of data within relational databases. The project includes a database schema workbook for designing tables and mapping relationships, alongside a dedicated SQL query guide for writing selection, filtering, and aggregation statements. These resources are delivered through a relational database tutorial and a broader SQL learning resource. The mater
Provides guides on writing queries to extract and aggregate specific information from relational tables.
Pythonbasesdedatoscursodatabase
عرض على GitHub8,826
redpanda-data/connect
redpanda-data/connect
8,681عرض على GitHub
Connect is a Kafka data integration platform and stream processing engine used to build declarative pipelines that move and transform messages between Kafka topics and external sources. It functions as a Kafka Connect framework and a change data capture tool, streaming real-time database modifications to synchronize data across distributed environments. The project differentiates itself through a dedicated mapping language for mutating and reshaping message payloads and the ability to execute custom processing logic within a sandboxed WebAssembly runtime. It also provides an observability pip
Extracts, filters, and aggregates data from relational tables using standard SQL query language.
Goamqpcqrsdata-engineering
عرض على GitHub8,681
nlpchina/elasticsearch-sql
NLPchina/elasticsearch-sql
7,012عرض على GitHub
This project provides a SQL interface for Elasticsearch, serving as a translator and database layer that allows users to retrieve, filter, and manipulate indices using structured query language. It functions by converting standard SQL statements into the native JSON query language used by the search engine. The system includes a geospatial SQL engine for executing location-based searches and distance calculations. It also features a query debugger used to visualize the translation process from SQL to search engine request bodies to verify the logic and accuracy of data retrieval. The capabil
Provides the ability to retrieve, filter, sort, and group data from indices using standard SQL syntax.
Java
عرض على GitHub7,012
apache/pinot
apache/pinot
6,098عرض على GitHub
Pinot is a distributed, columnar analytical database designed for high-concurrency, low-latency query processing. It functions as a real-time OLAP datastore, enabling interactive, user-facing analytics by ingesting and querying massive datasets from both streaming and batch sources. The system architecture relies on a centralized controller for cluster coordination and a distributed segment-based storage model to ensure horizontal scalability. The platform distinguishes itself through a hybrid ingestion pipeline that unifies real-time event streams and historical batch data into a single quer
Exposes a tabular data model for retrieving and analyzing information using standard SQL syntax.
Java
عرض على GitHub6,098
trailbaseio/trailbase
trailbaseio/trailbase
5,324عرض على GitHub
Trailbase هي منصة خلفية كخدمة (BaaS) يتم تسليمها كملف تنفيذي واحد يدمج محرك قاعدة بيانات في الوقت الفعلي، ومدير هوية ووصول، ومولد API آمن للأنواع. توفر بيئة خلفية شاملة بما في ذلك محرك تخزين مدعوم بـ SQLite وخادم وقت تشغيل WebAssembly لتنفيذ المنطق المخصص. تتميز المنصة بتحويل مخططات قاعدة البيانات تلقائياً إلى JSON APIs مع روابط عميل عبر اللغات، ومن خلال السماح بتنفيذ مكونات محمولة للعرض من جانب الخادم ومسارات HTTP مخصصة. كما تدمج قدرات قاعدة البيانات المتجهية لدعم تخزين التضمينات والبحث المتجهي القائم على التشابه. يغطي النظام مجموعة واسعة من القدرات التشغيلية، بما في ذلك مصادقة المستخدم مع دعم تسجيل الدخول الاجتماعي، وقوائم التحكم في الوصول لرؤية البيانات، ومزامنة pub-sub لتحديثات البيانات الحية. كما يوفر أدوات لإدارة مخططات قاعدة البيانات عبر عمليات ترحيل SQL والتعامل مع البيانات الجغرافية المكانية.
Allows direct execution of SQL queries for complex data modeling and retrieval.
Rustauthenticationdatabaserest-api
عرض على GitHub5,324
biopython/biopython
biopython/biopython
5,078عرض على GitHub
Biopython هي مكتبة معلوماتية حيوية لـ Python توفر أدوات لتحليل ومعالجة وتحليل التسلسلات البيولوجية، والهياكل الجزيئية، والأشجار التطورية. تعمل كمحلل للتسلسلات البيولوجية للبيانات الجينومية والبروتينية عبر تنسيقات ملفات متعددة معيارية في الصناعة، وتعمل كواجهة للاستعلام عن البيانات البيولوجية والاقتباسات من مستودعات NCBI Entrez. يتميز المشروع بمجموعات أدوات متخصصة لتحليل بنية البروتين وبناء الأشجار التطورية. يتضمن محلل بنية البروتين لمعالجة ملفات PDB و mmCIF لحساب الهندسة الجزيئية، بالإضافة إلى مجموعة أدوات للأشجار التطورية لتحليل العلاقات التطورية بين الأنواع. تغطي المكتبة مجموعة واسعة من قدرات المعلوماتية الحيوية، بما في ذلك تحليل التسلسل الجينومي للنسخ والترجمة، وإدارة محاذاة التسلسلات، وحسابات الوراثة السكانية. كما توفر أدوات تحليل هيكلية لمعالجة الإحداثيات الذرية ثلاثية الأبعاد، بالإضافة إلى أدوات لتصور الميزات الجينومية ونمذجة البيانات الجغرافية الحيوية. يتكامل النظام مع ملفات المعلوماتية الحيوية الثنائية الخارجية عبر تغليف الأدوات ويدعم تخزين السجلات البيولوجية المستمرة من خلال تخزين التسلسلات المدعوم بـ SQL.
Extracts biological records from relational databases on demand as sequence record objects.
Pythonbioinformaticsbiopythondna
عرض على GitHub5,078

Awesome SQL Data Retrieval GitHub Repositories

alibaba/DataX

risingwavelabs/risingwave

mouredev/hello-sql

redpanda-data/connect

NLPchina/elasticsearch-sql

apache/pinot

trailbaseio/trailbase

biopython/biopython

استكشف الوسوم الفرعية