10 مستودعات
Mechanisms to fetch database results as arrays or binary buffers instead of mapped objects for performance.
Distinct from Raw Data Accessors: Shortlist focused on AI completions or metadata, not raw value retrieval from database result sets.
Explore 10 awesome GitHub repositories matching data & databases · Raw Data Retrieval. Refine with filters or upvote what's useful.
This project is a PostgreSQL client library and SQL query builder for JavaScript and TypeScript. It provides a low-level database driver and connection manager to handle database sessions, along with a logical replication client for monitoring real-time changes. The library distinguishes itself with a high-performance bulk data streamer that utilizes the database copy command for importing and exporting large datasets. It also implements a logical replication protocol to facilitate real-time database synchronization through change subscriptions and channel-based notifications. The toolset co
Allows retrieving results as arrays of values or binary buffers instead of objects to increase performance.
هذا المشروع عبارة عن مجموعة بيانات واسعة النطاق للرسومات اليدوية، توفر ملايين الرسومات المتجهة (vector) والصور النقطية (bitmaps) المؤرخة لتدريب نماذج تعلم الآلة. تعمل كمرجع لتدريب الرؤية الحاسوبية ومجموعة بيانات للشبكات العصبية، وتتكون من رسومات بشرية مصنفة تُستخدم لتطوير خوارزميات تصنيف والتعرف على الصور. تتوفر مجموعة البيانات كمرجع للرسومات المتجهة يتميز بتسلسلات ضربات القلم والبيانات الوصفية، بالإضافة إلى مصفوفات numpy المعالجة. تدعم هذه الموارد تطوير مصنفات الرسم ودراسة أنماط الرسم البشري. يتم توفير البيانات بتنسيقات متعددة، بما في ذلك بيانات المتجهات الخام بتنسيق JSON مفصول بأسطر جديدة، وتسلسلات متجهة طبيعية، وصور نقطية بتدرج الرمادي. تتضمن قدرات للتقسيم القائم على الفئات وتوسيع الإحداثيات لضمان الاتساق عبر عينات مختلفة.
Retrieves unsimplified drawings containing pixel coordinates, timing information, and metadata in JSON format.
Webhook.site is a web-based tool that captures, inspects, and debugs incoming HTTP requests and emails sent to a unique URL, without requiring any server setup. It also functions as an API mocking and response modification tool, capable of generating mock APIs from OpenAPI specifications and altering HTTP responses, headers, and status codes for testing purposes. Beyond inspection, it serves as a platform for webhook automation and workflow orchestration, triggering multi-step automations—including database queries, SSH commands, and HTTP calls—when a webhook is received. The service distingu
Webhook.site returns the raw body and content-type of a specific or latest request as a direct response.
Timber is a PHP library that integrates the Twig template engine into WordPress themes, providing an object-oriented framework for theme development. It wraps WordPress data — posts, terms, users, menus, and comments — in structured PHP classes, allowing developers to work with objects instead of raw arrays while keeping HTML markup separate from PHP logic through Twig templates. The library distinguishes itself by offering a complete set of tools for modern WordPress theme building. It includes a file-based template hierarchy with fallback chains, dynamic image manipulation with resizing, cr
Retrieves unformatted theme header values for programmatic use in templates.
Grafana Tempo is a high-scale distributed tracing backend and columnar trace database. It serves as an observability data store that persists and queries spans and traces using OpenTelemetry standards, allowing for the analysis of request flows across microservices. The system distinguishes itself by using an object-store based backend with columnar Parquet storage. This architecture enables efficient attribute searching and large-scale data retrieval through dedicated attribute columnization and block-based data partitioning. It includes a specialized TraceQL query engine for filtering trace
Provides mechanisms to dump raw data or summaries for specific identifiers by iterating over storage blocks.
Connexion is a specification-driven framework for building APIs that automatically maps OpenAPI specifications to application logic. It uses these specifications to automate routing, request validation, and response serialization, linking API operations to backend handler functions via operation IDs. The project differentiates itself by providing a schema-driven mock server that simulates API behavior using example responses from the specification without requiring backend logic. It also includes a dynamic documentation hosting system that translates the API specification into a live interact
Allows retrieval of the raw request body and content-type directly during the request lifecycle.
هذا المشروع عبارة عن محرك بحث عن الثغرات وقاعدة معرفة أمنية مصممة لجمع وفهرسة الإفصاحات الأمنية العامة. يعمل كزاحف لقاعدة بيانات الثغرات يستخرج التقارير التقنية والعيوب الأمنية من مواقع الويب لإنشاء أرشيف محلي قابل للبحث. يستخدم النظام مفهرساً للمعرفة الأمنية وفهرساً مقلوباً للنص الكامل لتحويل البيانات الزاحفة غير المنظمة إلى تنسيق منظم. وهذا يسمح باسترجاع المعلومات القائم على الكلمات الرئيسية، مما يتيح تحديد مواقع عيوب أمنية محددة وتفاصيل تقنية من خلال واجهة بحث مخصصة. تغطي المنصة مجموعة واسعة من قدرات البحث الأمني، بما في ذلك جمع معلومات التهديدات وبحث الاستغلال العام. وتؤتمت عملية جمع بيانات الثغرات عن طريق تحليل محتوى HTML وتنظيم الأنماط الأمنية للاستعلام بشكل أسرع.
Retrieves raw content from security forums and disclosure pages via simulated HTTP requests.
Liveblocks is a realtime collaboration infrastructure platform that synchronizes application state, documents, and user presence across multiple participants using conflict-free replicated data types. It provides a managed backend for collaborative text editors, threaded commenting and annotation systems, in-app notifications, and AI copilot deployment, all built on a WebSocket transport layer with server-side room management APIs. The platform distinguishes itself through a headless component primitive system that exposes unstyled React hooks and composable building blocks, allowing develope
Liveblocks retrieves a specific version of a room's Yjs document as a binary Yjs update.
REFramework is a runtime modding framework for games built on the RE Engine, providing a C# plugin system with typed proxy code generation, a Lua scripting interface, an ImGui-based UI system, a runtime debugging toolkit, and a VR integration layer. At its core, it intercepts and modifies internal game engine functions and state at runtime through hooking and scripting APIs, enabling deep inspection and manipulation of live game objects. The framework distinguishes itself through typed proxy code generation that auto-creates C# interfaces from the game's type database, giving compile-time saf
Provides deserialization of proprietary RSZ binary data into managed objects for game modding.
MNBVC is a dataset pipeline and toolkit designed for the collection, cleaning, and normalization of massive text and code corpora used to train large language models. It provides specialized tools for harvesting source code, commit histories, and repository metadata from version control platforms, alongside a multilingual text corpus collector for gathering parallel text and academic papers. The project distinguishes itself through comprehensive capabilities for processing diverse document types, including a PDF-to-text converter that transforms complex layouts and formulas into structured JS
Supplies vast collections of internet-sourced Chinese text and raw PDF data for training models.