# Machine Learning Feature Stores

> Search results for `feature store for serving features to models` on awesome-repositories.com. 120 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/feature-store-for-serving-features-to-models

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/feature-store-for-serving-features-to-models).**

## Results

- [feature-engine/feature_engine](https://awesome-repositories.com/repository/feature-engine-feature-engine.md) (2,247 ⭐) — Feature engineering and selection open-source Python library compatible with sklearn.
- [open-compass/vlmevalkit](https://awesome-repositories.com/repository/open-compass-vlmevalkit.md) (3,824 ⭐) — VLMEvalKit is a vision-language model evaluation framework and inference engine designed to run standardized benchmarks and measure model accuracy across diverse visual datasets. It serves as a multimodal model benchmark and performance toolkit for calculating metrics and comparing model responses.

The toolkit includes a specialized visual reasoning evaluator that uses adversarial samples to distinguish actual image understanding from reliance on language patterns. It also provides capabilities for image generation evaluation, testing a model's ability to create or modify visuals based on text descriptions.

The framework covers multimodal inference execution and image-to-text generation, supported by batch inference execution to increase throughput. It provides utilities for benchmark score calculation, a model response browser for reviewing raw outputs, and attention mechanism optimization to reduce memory usage during inference.
- [dragonflydb/dragonfly](https://awesome-repositories.com/repository/dragonflydb-dragonfly.md) (30,688 ⭐) — Dragonfly is a high-performance, multi-model in-memory data store designed to serve as a drop-in replacement for existing database infrastructures. By utilizing a multi-threaded, shared-nothing architecture and a fiber-based concurrency model, it maximizes CPU utilization and minimizes latency for read and write operations. The system supports a wide range of data structures, including strings, hashes, lists, sets, sorted sets, and JSON documents, while maintaining full compatibility with standard industry wire protocols and client libraries.

What distinguishes Dragonfly is its focus on efficiency and scalability through advanced memory management and request processing. It employs a lock-free, cache-friendly hash table structure and zero-copy serialization to reduce overhead during high-throughput operations. For durability, the system utilizes asynchronous, snapshot-based persistence that captures the state of the dataset without blocking active requests. Furthermore, it provides built-in support for horizontal scaling and cluster management, allowing for the distribution of large datasets across multiple nodes to ensure high availability.

Beyond core storage, the platform includes a comprehensive suite of operational and analytical capabilities. It features integrated support for geospatial data management, real-time message brokering via publish-subscribe patterns, and full-text search. To handle massive datasets efficiently, the engine incorporates probabilistic data structures for cardinality estimation, frequency tracking, and membership testing. These features are complemented by robust administrative tools, including access control, request rate limiting, and detailed server monitoring.
- [devcontainers/cli](https://awesome-repositories.com/repository/devcontainers-cli.md) (2,482 ⭐) — This is a command line tool for building and managing isolated development environments based on the Development Container Specification. It functions as an OCI container image builder and a provisioner for instantiating standardized containers within automated continuous integration workflows.

The tool includes a system for injecting pre-configured software and toolsets into containers using a registry of reusable installation modules. This allows for the creation of shareable features and the installation of specific languages, CLI tools, and software dependencies.

It covers the automation of environment bootstrapping, the generation of prebuilt images to reduce startup time, and the execution of commands and lifecycle scripts within running containers. It also handles the mapping of workspace folders to ensure consistent setups across different machines.
- [mindsdb/mindsdb](https://awesome-repositories.com/repository/mindsdb-mindsdb.md) (39,313 ⭐) — MindsDB is an AI-native database engine that treats machine learning models and autonomous agents as virtual tables. By mapping external data sources, predictive models, and third-party services directly into the database schema, it enables users to perform inference, data retrieval, and complex orchestration using standard SQL syntax.

The platform distinguishes itself through an autonomous agent orchestrator that executes iterative reasoning loops, allowing agents to plan data access and synthesize natural language responses from connected knowledge bases. It functions as a federated data gateway, orchestrating queries across disparate external systems without requiring data movement or local storage. This architecture is supported by a modular connector framework that facilitates bidirectional communication with a wide range of cloud services, databases, and model registries.

Beyond its core orchestration capabilities, the system provides comprehensive tools for managing the lifecycle of agents and models, including custom model uploads and isolated execution environments. It includes administrative features for organizing schema objects into project namespaces, configuring persistent storage, and managing API connectivity. The platform is an open-source server that can be deployed across local or cloud environments, with Docker recommended for initial setup.
- [google/cpu_features](https://awesome-repositories.com/repository/google-cpu-features.md) (2,607 ⭐) — A cross platform C99 library to get cpu features at runtime.
- [viljamis/feature.js](https://awesome-repositories.com/repository/viljamis-feature-js.md) (3,148 ⭐) — Feature.js is a fast, simple and lightweight browser feature detection library in 1kb.
- [formbricks/formbricks](https://awesome-repositories.com/repository/formbricks-formbricks.md) (12,391 ⭐) — Formbricks is an open-source survey and feedback platform designed to help teams capture and analyze user insights through targeted, in-app, and website-based interactions. It functions as a comprehensive customer experience analytics system that allows organizations to maintain full control over their data, user attributes, and survey workflows.

The platform distinguishes itself through its event-driven architecture, which enables precise behavioral targeting by triggering surveys based on specific user actions or application events. It supports deep integration with external ecosystems by automatically synchronizing response data to CRMs, databases, and communication tools, while providing programmatic interfaces for managing resources and automating feedback loops.

Beyond core collection, the system includes advanced logic for conditional branching, scoring, and personalized routing to create adaptive survey experiences. It offers extensive customization options, including white-labeling, CSS overrides, and multi-channel distribution across web, mobile, and email environments.

The platform is built for self-hosting, supporting containerized deployments with built-in multi-tenant data isolation and enterprise-grade security features like single sign-on and role-based access control.
- [localforage/localforage](https://awesome-repositories.com/repository/localforage-localforage.md) (25,755 ⭐) — This project is an asynchronous key-value store designed for client-side data persistence. It provides a unified interface that allows applications to save and retrieve complex data types, including binary objects, while maintaining responsiveness through non-blocking operations. By enabling offline-first functionality, it ensures that data remains accessible even when a network connection is unavailable.

The library distinguishes itself through a driver-based abstraction layer that automatically detects the most efficient storage mechanism available in the current browser or mobile environment. It includes a pluggable architecture that supports custom storage backends and provides a fallback mechanism to ensure consistent performance across different platforms. To manage data organization, it features a containerized architecture that allows developers to isolate data sets into independent namespaces, preventing key collisions between different application modules.

Beyond its core storage capabilities, the system handles the serialization of complex objects and binary data automatically. It supports both promise-based and callback-oriented syntax for asynchronous tasks and includes type definitions to facilitate integration into development workflows. Configuration options allow for the customization of storage parameters, such as driver selection and database sizing, to meet specific application requirements.
- [ghostty-org/ghostty](https://awesome-repositories.com/repository/ghostty-org-ghostty.md) (56,570 ⭐) — Ghostty is a cross-platform terminal emulator that utilizes GPU-accelerated rendering to provide high-performance text output and low-latency input. It functions as a unified terminal environment, maintaining consistent feature parity and configuration across different operating systems through a specialized windowing abstraction layer.

The application is built on a declarative configuration engine that allows users to manage settings, keybindings, and visual themes using modular, plain-text files. It supports dynamic hot-reloading, enabling users to apply configuration changes in real-time without interrupting active sessions. The emulator also features system-aware theme synchronization, which automatically transitions between light and dark color palettes based on operating system preferences.

Beyond its core rendering capabilities, the project provides a foundation for interactive command-line tools by implementing standard terminal control sequences and parsing ANSI escape codes. This ensures compatibility with existing terminal-aware software and allows for consistent visual output across diverse development environments. Users can access comprehensive configuration documentation directly from the command line to manage their setup without requiring an external connection.
- [opensource-opentype/features](https://awesome-repositories.com/repository/opensource-opentype-features.md) (0 ⭐) — OpenType feature reference material
- [marcuswestin/store.js](https://awesome-repositories.com/repository/marcuswestin-store-js.md) (13,983 ⭐) — Store.js is a client-side state persistence library and web storage abstraction layer. It provides a consistent browser key-value storage wrapper that manages the persistence of application state across sessions using the most capable available local storage mechanism.

The engine is extensible, allowing for the integration of custom storage backends and plugins to modify how data is read and written. It functions as a reactive web storage manager, monitoring specific keys to trigger notifications when stored values are updated.

The library handles automatic data serialization for complex types and implements time-based data expiration. It includes mechanisms for cross-browser compatibility through automatic storage fallbacks and provides utilities for assigning default values and detecting storage availability in the browser environment.
- [chainlit/chainlit](https://awesome-repositories.com/repository/chainlit-chainlit.md) (12,213 ⭐) — Chainlit is a Python framework designed for building and deploying interactive, stateful conversational AI interfaces. It provides a backend-driven platform that connects language models and agent frameworks to a web-based chat frontend, managing the complexities of session state, message history, and real-time communication.

The framework distinguishes itself by offering a component-based UI builder that allows developers to inject interactive widgets, rich media, and data visualizations directly into the chat stream. It supports the visualization of complex agent workflows, enabling users to inspect intermediate reasoning steps and tool usage in real-time. Additionally, the platform includes built-in support for secure user authentication, persistent conversation history, and the ability to embed chat widgets into existing web applications with bidirectional communication.

The system covers a broad range of capabilities, including document processing, vector database integration for context-aware retrieval, and comprehensive observability tools for debugging and monitoring model interactions. It also provides extensive configuration options for interface customization, localization, and access control, ensuring that applications can be tailored to specific organizational requirements.

The project is distributed as a Python library and includes a command-line interface to facilitate project setup, configuration, and deployment.
- [willkoehrsen/feature-selector](https://awesome-repositories.com/repository/willkoehrsen-feature-selector.md) (0 ⭐) — Feature selector is a tool for dimensionality reduction of machine learning datasets.
- [mozilla/localforage](https://awesome-repositories.com/repository/mozilla-localforage.md) (25,780 ⭐) — localForage is a browser storage wrapper and key-value store that provides a unified JavaScript API for persisting data. It acts as an abstraction layer over IndexedDB, WebSQL, and localStorage, allowing web applications to save data offline.

The library features a priority-based driver fallback system that automatically detects and selects the most capable storage engine available in the browser. It simplifies data management by providing an asynchronous interface for storing complex JavaScript objects and binary buffers without requiring manual serialization.

The project covers broad capability areas including browser client-side storage, IndexedDB management, and the creation of isolated storage instances to keep data sets separate. It also supports the implementation of custom storage drivers to control how data is physically written and read.
- [fastapi/fastapi](https://awesome-repositories.com/repository/fastapi-fastapi.md) (99,260 ⭐) — FastAPI is a web framework for building APIs with Python. It leverages standard language type hints to provide automatic data validation, request parsing, and interactive API documentation generation. The framework supports asynchronous request handling and manages execution contexts to prevent blocking the main event loop.

The project includes a dependency injection system that allows for the resolution and injection of reusable components into request handlers. This system supports request-scoped caching, lifecycle management, and integration with security mechanisms like OAuth2 and JSON Web Tokens. Developers can organize applications into modular routers and mount sub-applications to manage complex routing logic.

Infrastructure features include middleware support for cross-origin resource sharing, background task management, and static file serving. The framework automatically generates OpenAPI specifications for defined endpoints, which can be customized through metadata and schema extensions. Testing utilities are provided to simulate HTTP and WebSocket connections, allowing for isolated verification of application behavior.
- [ericlbuehler/mistral.rs](https://awesome-repositories.com/repository/ericlbuehler-mistral-rs.md) (6,597 ⭐) — mistral.rs is an inference engine for large language models that runs locally and exposes models behind OpenAI and Anthropic-compatible APIs. It serves as a multi-model serving platform, capable of loading several models in a single server process with per-request routing and on-demand loading and unloading. The engine supports multimodal inference, processing text alongside images, video, audio, and speech inputs, and includes a quantized model deployment runtime that reduces memory use and speeds up inference on consumer hardware.

The project distinguishes itself through an agentic tool execution framework that runs server-side tools like code execution, shell commands, and web search in an automated loop during model generation, with session state persistence. It provides an in-process inference engine that can be embedded directly into Rust or Python applications without a separate server process, and includes an in-situ quantization engine that converts model weights to lower precision at load time with per-layer tuning. The system supports structured output constraints, forcing model output to conform to JSON Schema or grammar specifications during decoding, and offers automatic architecture detection that identifies model type, quantization format, and chat template from a Hugging Face model ID.

The platform includes capabilities for managing LoRA adapters, composing models as mixture-of-experts configurations, and running distributed inference across multiple GPUs or nodes using tensor parallelism and ring transport. It provides a built-in web chat interface, supports speculative decoding with a smaller assistant model, and offers benchmarking, logging, and Prometheus metrics for monitoring. The project can be run from a configuration file, with options for customizing build processes, tuning hardware settings automatically, and managing model caches.
- [jundongl/scikit-feature](https://awesome-repositories.com/repository/jundongl-scikit-feature.md) (1,571 ⭐) — open-source feature selection repository in python
- [readme/featured](https://awesome-repositories.com/repository/readme-featured.md) (0 ⭐)
- [deeppavlov/deeppavlov](https://awesome-repositories.com/repository/deeppavlov-deeppavlov.md) (6,985 ⭐) — DeepPavlov is a conversational AI framework and deep learning NLP library designed for building end-to-end dialogue systems and chatbots. It functions as an NLP pipeline orchestrator that allows users to compose pre-trained models and text processing components into sequential data flows for complex linguistic tasks.

The system is distinguished by its ability to act as a chatbot deployment server, exposing trained conversational models as web services via REST and Socket APIs. It utilizes JSON-based pipeline configurations and dynamic variable interpolation to decouple model logic from infrastructure, while automating the management of model dependencies and pre-trained weight injection.

The toolkit covers a wide range of information extraction and model development capabilities. This includes named entity recognition, entity linking, and various question answering systems spanning open-domain to knowledge base retrieval. It also provides tools for text classification, linguistic analysis, supervised model training, and hyperparameter optimization.

Additional operational features include text preprocessing and vectorization utilities, document ranking for information retrieval, and a dedicated metrics endpoint for monitoring service performance, latency, and throughput.
- [allinurl/goaccess](https://awesome-repositories.com/repository/allinurl-goaccess.md) (20,242 ⭐) — This project is a terminal-based log analyzer and real-time traffic monitoring system designed for web server environments. It processes raw server access logs to generate structured statistical insights, providing visibility into visitor behavior, bandwidth usage, and request latency. The tool functions as a high-performance utility capable of handling large-scale datasets through multi-threaded parsing and memory-efficient data structures.

The software distinguishes itself by offering both an interactive terminal interface and a web-based dashboard that updates in real-time via WebSocket streaming. It supports persistent data storage on disk, allowing for incremental updates and the retention of analysis state across sessions. Users can define custom log formats to ingest data from diverse web and cloud providers, while built-in security features enable the identification of malicious traffic patterns, bot activity, and geographic origins of requests.

Beyond core monitoring, the tool provides comprehensive reporting capabilities, including virtual host tracking and the ability to export processed statistics into machine-readable formats for external data pipelines. It is designed for flexible deployment, supporting execution within isolated container environments to facilitate portable log ingestion and analysis.
- [fastapi/sqlmodel](https://awesome-repositories.com/repository/fastapi-sqlmodel.md) (18,137 ⭐) — SQLModel is a type-safe object-relational mapping library for Python that integrates database schema definitions with data validation logic. By combining these two roles into a single class, it allows developers to manage relational data structures and enforce data integrity for web APIs simultaneously. The framework is built to support asynchronous database operations, enabling high-performance applications to execute queries and transactions without blocking the main execution thread.

The library distinguishes itself by leveraging Python type hints to provide IDE autocompletion and compile-time safety for database operations, effectively eliminating the need for raw SQL. It simplifies complex relational tasks by allowing developers to navigate and manage related records through object attributes, while automatically handling session lifecycles and transaction commits. Furthermore, it includes built-in support for circular dependency resolution and forward-reference type definitions, which helps maintain clean code organization in large-scale projects.

Beyond its core mapping capabilities, the project provides a comprehensive suite of tools for data lifecycle management, including automated schema initialization, migration tracking, and granular control over cascade operations. It also features robust testing utilities, such as dependency overrides and support for in-memory database execution, to facilitate isolated and efficient test environments. Security is addressed through automatic query sanitization, which protects database interactions from malicious input.
- [mlflow/mlflow](https://awesome-repositories.com/repository/mlflow-mlflow.md) (26,554 ⭐)
- [rse/es6-features](https://awesome-repositories.com/repository/rse-es6-features.md) (6,163 ⭐) — This project is a reference catalogue of the new syntax and behavioral changes introduced in the ECMAScript 6 (ES6) specification for JavaScript. It serves as a curated overview and comparison table of the language features defined by the standard, documenting additions such as block scoping with `let` and `const`, arrow function syntax, default parameter handling, and rest parameter collection.

The reference is structured as a side-by-side comparison chart that maps old and new JavaScript syntax patterns, providing a focused guide for each capability. It covers the core language fundamentals introduced in ES6, including lexical `this` binding in callbacks, immutable variable declarations, block-scoped functions and variables, and concise anonymous function syntax.

The documentation is organized around a specification-driven feature mapping, breaking down each ES6 capability into its smallest testable unit. It includes a cross-engine compatibility matrix that documents which JavaScript engines support each feature, serving as a canonical source for tooling, education, and static code analysis baselines.
- [jonathan-bird/feature-switch](https://awesome-repositories.com/repository/jonathan-bird-feature-switch.md) (0 ⭐) — Feature Switching (made easy) for PHP
- [paddlepaddle/paddlenlp](https://awesome-repositories.com/repository/paddlepaddle-paddlenlp.md) (12,953 ⭐) — PaddleNLP is a development library and toolkit for training, fine-tuning, and deploying large and small language models using the PaddlePaddle framework. It provides a comprehensive suite for the entire natural language processing lifecycle, from model development to high-performance inference.

The project features a standardized model zoo for loading and managing pre-trained models and tokenizers through a unified interface. It distinguishes itself with a specialized model compression framework that reduces memory footprints via weight precision conversion and lossless size optimization, alongside an inference engine that utilizes operator fusion and backend-agnostic execution to increase token generation speed.

The library covers a broad range of capabilities including distributed parallel training, parameter-efficient fine-tuning, and model weight merging. It also supports a full natural language processing pipeline for tasks such as text generation and zero-shot structured information extraction.
- [embarcadero/dev-cpp](https://awesome-repositories.com/repository/embarcadero-dev-cpp.md) (2,882 ⭐) — Dev-Cpp is a comprehensive development suite that serves as a C++ integrated development environment, a cross-platform application builder, and a visual UI designer. It provides a toolchain for writing, compiling, and debugging native C++ applications on Windows, while offering a framework to create native binaries for desktop, mobile, and IoT devices from a single codebase.

The project distinguishes itself by integrating an embedded SQL database engine and a REST API development platform directly into the workflow. It includes an AI-assisted coding tool that leverages large language models to automate code generation and orchestrate development workflows.

The platform covers a broad range of capabilities, including relational data management with SQL execution, native code profiling, and the construction of responsive user interfaces using visual components. It also supports the development of server-side web interfaces, hardware protocol connectivity for IoT devices, and identity management through directory services.
- [frewsxcv/cargo-all-features](https://awesome-repositories.com/repository/frewsxcv-cargo-all-features.md) (186 ⭐) — A Cargo subcommand to build and test all feature flag combinations.
- [cifertech/esp32-div](https://awesome-repositories.com/repository/cifertech-esp32-div.md) (2,552 ⭐) — ESP32-DIV is a handheld wireless pentesting platform designed for analyzing and disrupting a wide range of wireless protocols. It functions as a multi-band radio analyzer, RFID and NFC tag manipulator, and GPS wardriving logger, providing a unified interface for security auditing and signal research.

The project distinguishes itself through a modular radio abstraction that allows switching between Wi-Fi, BLE, Sub-GHz, RFID/NFC, and infrared hardware modules. It features a touch-driven TFT interface for navigating toolsets and managing signal profiles, as well as the ability to emulate Bluetooth HID devices for executing predefined scripts on target hosts.

The platform covers comprehensive wireless analysis capabilities, including packet injection and monitoring for Wi-Fi, traffic sniffing for BLE, and signal capture and replay for Sub-GHz and infrared spectra. It also provides specialized tools for reading, cloning, and erasing RFID tags, alongside geospatial logging for mapping wireless access points via GNSS coordinates.

System management is handled through a combination of over-the-air updates, SD card firmware installation, and USB flashing. Local storage on a FAT-formatted SD card is used for persisting device configurations, captured radio waveforms, and PCAP files.
- [modular/modular](https://awesome-repositories.com/repository/modular-modular.md) (26,357 ⭐) — Modular is a unified machine learning development platform designed for building, compiling, and deploying high-performance neural network models. It provides a comprehensive execution engine that supports both local and production-grade inference, enabling developers to manage the entire model lifecycle from initial architecture definition to scalable, containerized service deployment.

The platform distinguishes itself through a hardware-agnostic runtime that abstracts diverse silicon architectures, allowing models to execute efficiently across varied compute environments. It includes a specialized stack for systems-level kernel programming, which provides direct memory control and low-level access to hardware primitives. This allows for the development of custom neural network operators and high-performance compute kernels, which are then integrated into optimized execution graphs through automated compilation and operator fusion.

Beyond core execution, the platform offers extensive tooling for performance engineering, including granular profiling instrumentation, hardware-specific bottleneck analysis, and automated benchmarking against defined datasets. It supports a wide range of generative AI tasks through a standardized, multi-modal interface that handles text, image, and video generation. The system also manages infrastructure requirements, including environment orchestration, dependency synchronization, and automated workload routing for high-throughput production clusters.
- [kvcache-ai/ktransformers](https://awesome-repositories.com/repository/kvcache-ai-ktransformers.md) (17,288 ⭐) — Ktransformers is a comprehensive framework designed for the operation, fine-tuning, and serving of large language models. It functions as a heterogeneous inference engine and quantized execution runtime, enabling the deployment of massive models by distributing computational workloads across both CPU and GPU resources. This architecture allows users to bypass local memory constraints, making it possible to run and train models that exceed the capacity of a single device.

The project distinguishes itself through specialized support for sparse architectures, particularly mixture-of-experts models. It employs pipelined expert offloading and layer-wise sharding to balance memory usage and processing speed across heterogeneous hardware. By utilizing hardware-specific kernel optimizations, such as specialized instruction sets for server processors, the framework maximizes throughput for both inference and fine-tuning tasks.

Beyond its core execution capabilities, the project provides a production-ready serving environment that exposes models via an OpenAI-compatible HTTP interface. It includes a suite of command-line tools for managing model deployments, configuring system environments, and performing performance benchmarking. The framework also supports the integration of custom inference kernels and operator injection, allowing for architectural modifications and fine-tuned control over model placement strategies.
- [capsoftware/cap](https://awesome-repositories.com/repository/capsoftware-cap.md) (17,026 ⭐) — Cap is a self-hosted screen recording and video collaboration platform designed for teams to replace synchronous meetings with asynchronous video updates. It provides a comprehensive suite for capturing high-resolution desktop activity, including system audio, microphone input, and camera overlays, which are then processed through an integrated post-production workflow.

The platform distinguishes itself by offering full data sovereignty through containerized deployment and object storage abstractions, allowing users to host their media assets on private infrastructure or S3-compatible buckets. Beyond simple recording, it features keyframe-based video compositing, automated AI-powered transcription, and visual branding tools that enable creators to polish and annotate their content before sharing.

The system facilitates team engagement through a centralized workspace where viewers can provide feedback via timestamped comments, reactions, and playback analytics. It also includes programmatic interfaces for embedding videos into external applications, managing media assets, and automating distribution workflows.

The project is distributed as a containerized application, enabling deployment on private servers to maintain complete control over data storage and access permissions.
- [willmendesneto/ngx-feature-toggle](https://awesome-repositories.com/repository/willmendesneto-ngx-feature-toggle.md) (99 ⭐) — Your module to handle with feature toggles in Angular applications easier.
- [rednote-hilab/dots.ocr](https://awesome-repositories.com/repository/rednote-hilab-dots-ocr.md) (7,695 ⭐) — dots.ocr is a suite of software utilities for document layout analysis, multilingual optical character recognition, and scene text digitization. It functions as an engine for extracting digital text and structured layout data from images and PDFs across various human scripts.

The project includes a specialized transformer for converting charts, diagrams, and chemical formulas from raster images into scalable vector graphics. It also provides a pipeline to transform extracted text and structural layout from documents and web screenshots into formatted Markdown files.

The system covers capabilities for identifying bounding boxes and categories of layout elements to produce structured JSON representations. It further includes tools for scene text detection within natural images and an evaluation framework for measuring text and table extraction accuracy against ground truth data.
- [etcd-io/etcd](https://awesome-repositories.com/repository/etcd-io-etcd.md) (51,838 ⭐) — etcd is a distributed, strongly consistent key-value store designed to provide reliable storage for critical system metadata and coordination primitives. It functions as a distributed consensus engine, utilizing a replicated log and leader-based state machine to ensure that all nodes in a cluster maintain a synchronized view of data. By providing atomic operations and linearizable reads and writes, it serves as a foundational component for distributed systems requiring high availability and fault tolerance.

The system distinguishes itself through its multi-version concurrency control, which enables non-blocking read operations while maintaining strict consistency for concurrent writes. It supports complex distributed coordination through features like lease-based expiration, which allows for the automatic removal of data based on client activity, and asynchronous key change monitoring, which provides real-time event notifications for data modifications. These capabilities are supported by a persistent B-tree-based storage engine and write-ahead logging to ensure durability across system crashes.

Beyond its core storage functions, the project provides a comprehensive suite of tools for cluster management, including automated peer discovery via DNS or service registries and robust security enforcement. It includes built-in mechanisms for transport layer security, role-based access control, and certificate management to protect data in transit and at rest. Operational reliability is further maintained through snapshot-based disaster recovery, cluster health monitoring, and granular performance tuning for disk and network resources.

The system is configured through structured files or command-line flags, allowing for flexible deployment across diverse infrastructure environments.
- [suffix-maybe-feature/adver-suffix-maybe-features](https://awesome-repositories.com/repository/suffix-maybe-feature-adver-suffix-maybe-features.md) (0 ⭐) — This repository provides tools and datasets for investigating adversarial suffix attacks using embedding techniques. Our project focuses on generating adversarial embeddings and creating harmful datasets for Llama2 and Llama3 models, demonstrating potential vulnerabilities in these language models.
- [ricvolpi/adversarial-feature-augmentation](https://awesome-repositories.com/repository/ricvolpi-adversarial-feature-augmentation.md) (131 ⭐) — Code for the paper "Adversarial Feature Augmentation for Unsupervised Domain Adaptation", CVPR 2018
- [nvidia/tensorrt-llm](https://awesome-repositories.com/repository/nvidia-tensorrt-llm.md) (12,913 ⭐) — TensorRT-LLM is a platform and toolkit designed for compiling, optimizing, and serving transformer-based models on accelerated hardware. It functions as a framework that transforms machine learning models into efficient execution graphs, providing an engine to refine these models for specific hardware to maximize throughput and minimize latency during text generation.

The project distinguishes itself through advanced execution strategies that manage the entire inference pipeline. It utilizes kernel-level fusion and static graph execution to optimize mathematical operations and computational flow, while implementing paged attention memory management to handle long sequence lengths without memory fragmentation. These capabilities are integrated with in-flight request batching and custom decoding logic, which allow for the direct implementation of sampling strategies within the execution pipeline to reduce data transfer overhead.

The toolkit supports both online model serving for scalable, concurrent request handling and offline batch inference for high-volume, non-interactive processing. It provides comprehensive controls for managing attention memory and configuring decoding parameters, ensuring that hardware utilization remains efficient across diverse deployment environments.
- [c0re100/qbittorrent-enhanced-edition](https://awesome-repositories.com/repository/c0re100-qbittorrent-enhanced-edition.md) (25,128 ⭐) — qBittorrent-Enhanced-Edition is a cross-platform desktop application designed to manage the downloading and uploading of files across peer-to-peer networks. It functions as an open-source file sharer, facilitating the decentralized distribution of digital content by breaking files into smaller pieces for efficient transfer.

The application utilizes a high-performance library to handle complex protocol specifications and employs a mature widget toolkit to provide a consistent native user interface across Windows, macOS, and Linux. It operates as a network traffic manager, incorporating asynchronous event-driven networking and multi-threaded task scheduling to maintain high throughput and system responsiveness during large-scale data transfers.

Beyond core file sharing, the software includes capabilities for automated content acquisition, remote management via web browsers, and granular bandwidth control. It supports extensible search functionality through external scripts and maintains state integrity using a local relational database for metadata storage.
- [vllm-project/vllm](https://awesome-repositories.com/repository/vllm-project-vllm.md) (83,048 ⭐) — vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token generation speed and memory efficiency, enabling both large-scale cloud deployments and local execution on personal hardware.

The project distinguishes itself through advanced memory management and request scheduling techniques, most notably its use of non-contiguous key-value cache blocks to eliminate fragmentation and its ability to dynamically insert new sequences into batches as they arrive. It provides a hardware-agnostic abstraction layer that maps complex mathematical operations to diverse accelerators, including specialized GPUs and consumer-grade silicon like Apple hardware. This is further supported by custom kernel fusion and a flexible quantization framework that allows for the compression of neural networks to fit resource-constrained environments.

Beyond its core runtime, the framework offers extensive support for custom
- [zenkiet/ngx-feature-proxy](https://awesome-repositories.com/repository/zenkiet-ngx-feature-proxy.md) (33 ⭐) — Angular feature flag library with Unleash integration, provides reactive programing, type-safe feature flag management with zero-configuration setup.
- [fatih/vim-go](https://awesome-repositories.com/repository/fatih-vim-go.md) (16,242 ⭐) — This project is a comprehensive development plugin for the Vim text editor, designed to transform the editor into a specialized environment for Go software development. It provides a suite of integrated tools that facilitate the entire development lifecycle, including writing, maintaining, and navigating Go source code.

The plugin distinguishes itself by orchestrating external command-line binaries and language servers to perform heavy analysis, compilation, and formatting tasks as background processes. By mapping editor events to these external tools and utilizing virtual file system synchronization, it maintains editor responsiveness while providing real-time code intelligence, diagnostics, and automated structural changes.

The feature set covers a broad range of development requirements, including automated import management, static code analysis, and integrated build and test execution. It also supports advanced workflows such as program debugging, test coverage visualization, and documentation lookup, all accessible directly within the editor workspace.
- [chatwoot/chatwoot](https://awesome-repositories.com/repository/chatwoot-chatwoot.md) (31,959 ⭐) — Chatwoot is a self-hosted, omnichannel customer support platform designed to aggregate messages from diverse social and digital channels into a single, collaborative team inbox. It provides organizations with full data ownership and control over their support infrastructure, ensuring strict logical separation of customer data through multi-tenant architecture. By centralizing communication, the platform enables teams to manage, route, and resolve inquiries within a unified workspace that maintains complete interaction history for every contact.

The platform distinguishes itself through an event-driven automation engine and a visual rule builder that allow teams to manage conversations and workflows without writing custom code. It incorporates intelligent features such as automated response drafting, conversation context recall, and a self-service knowledge base to improve agent efficiency. These capabilities are supported by granular role-based access controls and comprehensive performance analytics, which provide insights into agent productivity, inbox activity, and customer satisfaction trends.

Beyond its core messaging and routing functions, the system offers a broad suite of operational tools including proactive engagement triggers, team workload balancing, and multilingual support. It supports flexible deployment strategies, including containerized and cloud-native orchestration, to accommodate various production environments. The platform is designed for extensibility, allowing for custom attribute management and integration with external systems via webhooks and API-based channels.
- [thomaspoignant/go-feature-flag](https://awesome-repositories.com/repository/thomaspoignant-go-feature-flag.md) (2,036 ⭐) — GO Feature Flag is a simple, complete and lightweight self-hosted cloud native feature flag solution 100% Open Source. 🎛️
- [qwenlm/qwen3](https://awesome-repositories.com/repository/qwenlm-qwen3.md) (27,324 ⭐) — Qwen3 is a transformer-based large language model designed as a generative AI foundation for understanding, reasoning, and generating human language. It functions as a comprehensive ecosystem for model training, fine-tuning, and production-ready inference, providing the underlying architecture and weights necessary to build diverse artificial intelligence applications.

The project distinguishes itself through extensive support for model quantization and distributed inference, enabling efficient execution across a wide range of hardware from consumer-grade devices to scalable cloud infrastructure. It includes a specialized toolkit for weight compression and memory optimization, such as key-value cache management, which reduces computational requirements while maintaining performance. Furthermore, the model integrates with agentic frameworks, allowing for the development of autonomous systems capable of executing complex workflows and interacting with external tools.

The ecosystem covers a broad surface of deployment and training methodologies, including standardized interfaces for modular plugin integration and function calling. It provides extensive documentation for various training, fine-tuning, and serving environments to facilitate integration into existing software stacks.
- [brew/opentype-feature-bundle](https://awesome-repositories.com/repository/brew-opentype-feature-bundle.md) (0 ⭐) — This covers Adobe’s OpenType Feature File Specification used by the AFDKO. As well as syntax highlighting, there are also a couple of snippets for creating features, tables etc.
- [jantic/deoldify](https://awesome-repositories.com/repository/jantic-deoldify.md) (18,487 ⭐) — DeOldify is a deep learning system and a set of pre-trained computer vision models designed to apply realistic colors to grayscale photographs and video footage. It functions as a neural media restoration tool that uses trained networks to estimate original hues for black-and-white media and remove glitches and artifacts from aged images and film.

The project employs a NoGAN colorization technique that removes the GAN discriminator during training to prevent artifacts and avoid over-saturation of pixels. For cinematic sequences, it applies temporal frame consistency to maintain color stability and prevent flickering between consecutive frames.

The framework covers broad capability areas including deep learning image processing and media restoration. It utilizes generative adversarial networks, convolutional neural architectures, and automatic color labeling to synthesize realistic color values based on learned patterns from large datasets.
- [fastapi/typer](https://awesome-repositories.com/repository/fastapi-typer.md) (19,632 ⭐) — This project is a Python framework for building command-line interfaces by converting standard functions into executable programs. It uses type hints to automatically infer and generate argument parsers, validation logic, and help documentation, allowing developers to define complex terminal applications through simple function signatures.

The framework distinguishes itself through a decorator-driven registration system that enables the construction of hierarchical command trees. It supports dependency injection to manage shared state and runtime configuration across subcommands, and it utilizes reflective metadata inspection to dynamically build help screens and parameter configurations.

Beyond core parsing, the library provides a comprehensive suite of tools for terminal interaction, including support for interactive prompts, secure input collection, and visual feedback like progress indicators. It also handles advanced system integration tasks such as generating shell completion scripts, reading configuration from environment variables, and formatting terminal output with custom styling.

The project is designed to be installed as a standard Python package, enabling developers to expose command-line entry points directly from their modules.
- [facebook/react](https://awesome-repositories.com/repository/facebook-react.md) (245,669 ⭐) — React is a JavaScript library for building user interfaces based on a component-driven architecture and unidirectional data flow.
- [deepseek-ai/deepseek-v3](https://awesome-repositories.com/repository/deepseek-ai-deepseek-v3.md) (103,753 ⭐) — DeepSeek-V3 is a large language model that provides comprehensive resources for model utilization, including technical specifications, pre-trained weights, and evaluation benchmarks. The project details the core transformer architecture, including parameter counts and multi-token prediction modules, while supporting native 8-bit floating-point quantization.

The repository offers extensive support for local and distributed inference through integration with multiple frameworks and engines. It includes documentation for deploying the model across various hardware configurations, such as GPUs and specialized accelerators, alongside guidance for environment setup and weight conversion. Users can access the model through a web-based chat interface, a programmatic API, or by downloading weights for local execution.
