22 مستودعات
Tools and configurations for managing database indexes to optimize query performance and enforce data constraints.
Distinguishing note: Focuses specifically on the management and configuration of database indexes within a data modeling framework, distinct from general database management.
Explore 22 awesome GitHub repositories matching data & databases · Database Indexing. Refine with filters or upvote what's useful.
Twenty is a headless customer relationship management framework that enables developers to build, version, and deploy custom business applications using code. By utilizing a declarative approach to data modeling, the platform allows for the definition of custom objects, fields, and complex relationships directly within the source code. This schema-driven architecture automatically generates corresponding REST and GraphQL APIs, ensuring that data structures and interface components remain synchronized across development and production environments. The platform distinguishes itself through a m
Optimizes database query performance by allowing the definition of single or multi-column indexes and unique constraints on custom data objects.
This project is a comprehensive educational resource focused on the principles, patterns, and trade-offs required to design scalable, reliable, and high-performance distributed systems. It provides a structured curriculum that covers the fundamental architectural strategies necessary for building modern software infrastructure, ranging from high-level system decomposition to low-level networking and data management. The repository distinguishes itself by offering deep dives into complex architectural patterns, such as microservices-based decomposition, event-driven communication, and command-
Explains the performance trade-offs of using indexes for data retrieval.
Payload is a headless content management system and application framework that uses a code-first approach to define data schemas and administrative interfaces. By utilizing a centralized, type-safe configuration object, it automatically generates database schemas, API endpoints, and a fully customizable admin panel. The system is built on a database-agnostic architecture, allowing it to interface with various storage engines while providing a unified, type-safe API for server-side operations, REST, and GraphQL. What distinguishes Payload is its deep extensibility and developer-centric design.
Optimizes query performance by allowing the creation of compound indexes on multiple fields.
TypeORM is an object-relational mapper for TypeScript and JavaScript that bridges the gap between object-oriented application code and relational database tables. It provides a comprehensive data persistence layer that allows developers to define database entities using class decorators or configuration objects, enabling seamless interaction with data through object-oriented patterns. The project distinguishes itself through a flexible architecture that supports both the data mapper and repository patterns, alongside a fluent query builder that translates high-level method calls into platform
Enables programmatic definition of indexes on entity columns to optimize query performance.
Knex is a multi-dialect database client that provides a programmatic SQL query builder, a connection pool manager, and a versioned schema migration tool. It enables programmatic database interaction across multiple SQL engines, including PostgreSQL, MySQL, SQLite3, SQL Server, CockroachDB, and Oracle. The project distinguishes itself through a fluent interface for constructing complex SQL statements and a dedicated framework for database seeding. It utilizes specialized dialects to translate generic query representations into database-specific syntax while maintaining a consistent API across
Provides programmatic tools for building and managing specialized database indexes to optimize retrieval performance.
SQLModel is a type-safe object-relational mapping library for Python that integrates database schema definitions with data validation logic. By combining these two roles into a single class, it allows developers to manage relational data structures and enforce data integrity for web APIs simultaneously. The framework is built to support asynchronous database operations, enabling high-performance applications to execute queries and transactions without blocking the main execution thread. The library distinguishes itself by leveraging Python type hints to provide IDE autocompletion and compile-
Configures model fields to be indexed in the database to optimize query performance.
FoundationDB is an ACID-compliant distributed transactional key-value store. It functions as a scalable database engine that ensures strict serializability and data consistency across a cluster of servers using a shared-nothing architecture. The system is distinguished by its multi-region replication capabilities, allowing data to be synchronized across different datacenters for high availability and disaster recovery. It utilizes optimistic concurrency control to manage distributed transactions and employs a majority-based coordination system to maintain cluster state. The platform provides
Builds simple or spatial indexes to enable efficient data retrieval across multiple search criteria.
NeDB is a JavaScript embedded NoSQL document store designed for Node.js and the browser. It functions as an in-memory data store with the option to persist documents to a local file system, ensuring data survives application restarts. The project utilizes a MongoDB-compatible API to perform data operations, allowing it to serve as a lightweight document indexing system and a persistent file database without requiring a separate database server. Capabilities include querying, inserting, updating, and deleting documents, as well as the ability to create indexes on specific fields to accelerate
Creates indexes on specific fields to accelerate data retrieval and enforce value uniqueness.
Citus is a PostgreSQL extension that transforms a standard database into a distributed system. It functions as a sharding framework and distributed SQL engine, enabling horizontal scaling by partitioning tables across a cluster of nodes. By utilizing a coordinator-worker topology, the system manages metadata and routes queries to the appropriate nodes, allowing for parallel execution of complex operations across distributed data shards. The platform distinguishes itself through its specialized support for multi-tenant architectures and real-time analytical processing. It enables tenant-based
Scans worker nodes to identify unused indexes on distributed tables to reduce storage overhead.
This project is a software engineering style guide and a curated collection of architectural patterns and coding standards. It provides a multi-language coding standard to ensure maintainable software across Ruby, Python, JavaScript, and Swift. The project establishes a development workflow specification for version control, continuous integration, and peer review to maintain a linear project history. It also includes a web accessibility framework based on ARIA and WCAG standards, using design tokens and semantic HTML patterns to build inclusive interfaces. The guides cover a broad range of
Recommends efficient search strategies using partial and compound indexes to improve performance.
Sea-ORM is an asynchronous SQL object-relational mapper and database toolkit for mapping relational tables to strongly typed objects. It provides a relational database mapper for performing CRUD operations across MySQL, PostgreSQL, and SQLite, and includes a programmatic dynamic query builder for constructing complex SQL statements. The project distinguishes itself by providing a GraphQL schema generator that transforms database entities into typed schemas with built-in pagination and filters. It also features a dedicated database migration tool for defining and applying versioned schema chan
Generates and executes SQL commands to create database indices based on entity definitions.
Soar is a suite of specialized tools designed for analyzing MySQL performance, advising on indexing, and optimizing SQL syntax. It functions as a performance analyzer, index advisor, and query optimizer to identify bottlenecks and suggest structural improvements for faster execution. The project distinguishes itself through a system for rewriting SQL statements into optimized equivalent versions using custom heuristic rules and patterns. It also features a dedicated index advisor that evaluates query patterns and database metadata to recommend the creation of new indexes. Its broader capabil
Evaluates query patterns and execution plans to recommend the most efficient indexing strategies.
The mongo-go-driver is a Go library for building applications that integrate with a MongoDB document store. It enables the storage and retrieval of flexible document data by providing a bridge between Go backends and the database. The driver implements specialized capabilities for semantic vector search, allowing the handling and execution of high-dimensional vector data for similarity-based retrieval. It also supports full-text search via linguistic analysis and programmatic search index management. The project covers a broad range of database operations, including document-based CRUD, bulk
Allows programmatic creation and configuration of database indexes to optimize query performance.
This project is a MongoDB database driver and object-relational mapper that brings MongoDB support to the Laravel Eloquent model and query builder. It provides a NoSQL model mapper that allows MongoDB collections to be mapped to object-oriented models using the Active Record pattern. The integration enables the use of a fluent query builder for constructing queries and aggregation pipelines without writing raw database syntax. It supports schema-less model integration, allowing applications to manage unstructured data while maintaining compatibility with standard object-oriented patterns. Th
Provides a schema builder to create and manage unique, compound, sparse, and geospatial indexes.
Dejavu is a Python audio fingerprinting library and recognition engine. It functions as a digital audio signature tool used to analyze sound waves and create unique identifiers for the purposes of audio search and retrieval. The project enables automatic music identification by matching live audio feeds or recorded clips against a database of fingerprints. It covers audio content matching and digital audio archiving to identify original source recordings from a stored collection. The system incorporates capabilities for generating audio fingerprints, identifying audio tracks, and recognizing
Provides a keyed database indexing system to allow rapid lookup of matching audio fingerprints.
Seek-Tune is an audio fingerprinting library that implements a Shazam-like algorithm for identifying songs from audio recordings. It generates acoustic fingerprints from audio signals and matches them against a known database to recognize songs. The library converts audio into a time-frequency spectrogram using FFT-based frequency analysis, then extracts peak points to create compact, unique fingerprints for each song. It uses combinatorial hashing to combine nearby peak pairs into hash values with time offsets, enabling efficient database lookup and matching through a peak-pair matching algo
Saves generated audio fingerprints to a configurable database for later matching and retrieval.
TinyBase هو مخزن بيانات تفاعلي وقاعدة بيانات علائقية في الذاكرة مصممة لاستمرار حالة جهة العميل. يعمل كمحرك مزامنة محلي أولاً يدمج الحالة الموزعة باستخدام أنواع بيانات متماثلة خالية من التعارض (CRDTs) وساعات منطقية لضمان تقارب البيانات الحتمي. يتميز المشروع بمكتبة للتحقق من المخطط تحول التعريفات الخارجية من أدوات مثل Zod وYup وTypeBox إلى تعريفات مخزن آمنة للنوع. ويوفر بنية تحتية للتحرير التعاوني في الوقت الفعلي، باستخدام المزامنة مع Automerge وYjs وPartyKit للحفاظ على حالة متسقة عبر عملاء وخوادم متعددة. تشمل مساحة القدرات نمذجة البيانات العلائقية مع الجداول والمفاتيح الخارجية، والاستعلام والفهرسة الشبيهة بـ SQL، والمعاملات الذرية للطفرات المجمعة. يدعم مجموعة واسعة من محولات الاستمرار، بما في ذلك تخزين المتصفح، وSQLite، وCloudflare Durable Objects. يوفر النظام أيضاً ربط حالة ثنائي الاتجاه ومكونات تعريفية للتكامل مع React وSolidJS وSvelte.
Creates indexes that allow rapid retrieval of matching row objects within large data tables.
This project is an anime scene reverse image search engine that matches a screenshot to the exact anime episode and timestamp. It is designed as a self-hosted search service that can be deployed using Docker containers and pre-indexed databases, enabling private operation on local or custom infrastructure. At its core, the system extracts visual features from frames using a convolutional neural network trained on anime imagery. Query images provided via URL are processed through the same feature extraction pipeline, and an approximate nearest neighbor search matches the query against millions
Loads a snapshot of frame features, metadata, and timestamps into the search service for instant readiness.
Sonyflake هي مكتبة معرفات فريدة موزعة تنشئ معرفات مرتبة زمنياً وخالية من التصادم عبر أجهزة متعددة. تعمل كمولد يعتمد على الوقت يقوم بتشفير الطوابع الزمنية في معرفات، مما يضمن ترتيب المعرفات بشكل رتيب. يعمل النظام بدون سلطة تنسيق مركزية، وينتج معرفات فريدة عبر العقد الموزعة دون الحاجة إلى اتصال حالة مشترك أو أقفال مركزية. يمنع التصادم بين العقد المختلفة من خلال دمج معرفات الأجهزة المدركة للمضيف، والتي يتم حلها باستخدام عناوين الشبكة الخاصة أو بيانات تعريف مثيلات السحابة. تدعم المكتبة تخصيص تخصيص البتات، مما يسمح بموازنة عدد الأجهزة ومعدلات التوليد. تمكن هذه القدرات من توليد مفاتيح فريدة قابلة للفرز لفهرسة قواعد البيانات الموزعة وإدارة الهوية عبر الخدمات المصغرة.
Generates sortable unique keys that optimize write performance and data organization in distributed database indexes.
MongoEngine هو معين كائن-مستند (ODM) بلغة Python يترجم سجلات قاعدة البيانات إلى كائنات لتوفير واجهة موجهة للكائنات لاستمرارية البيانات. يعمل كمدير مستندات ومتحقق من المخطط لـ MongoDB، حيث يقوم بتعيين الفئات إلى المستندات لفرض أنواع البيانات وقواعد التحقق. يوفر المشروع نظام استعلام (queryset) محمل بكسل (lazy-loaded) لتصفية وتجميع المجموعات باستخدام بناء جملة Pythonic. يدير هياكل البيانات المعقدة من خلال ميزات مثل وراثة المستندات، ومعالجة المستندات المضمنة العودية، وربط الكائنات القائم على المرجع. تغطي المكتبة قدرات واسعة بما في ذلك ترحيل المخطط، والبحث النصي الكامل، وإدارة الملفات الثنائية الكبيرة عبر نظام ملفات GridFS. كما تتضمن أدوات لتحسين فهرس قاعدة البيانات، وتوصيف أداء الاستعلام، وخطافات دورة الحياة القائمة على الإشارة لأتمتة المنطق أثناء أحداث المستند.
Supports the creation of collection indexes with multi-field keys and background processing.