awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms

686 repos

Awesome GitHub Repositories

The best open-source projects, ranked by AI. Just describe what you need — we curate thousands of GitHub repositories so you can find the right tool, library, or framework in seconds.

Browse repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • google-research/tuning_playbook

    google-research/tuning_playbook

    29,826View on GitHub↗

    This project is a comprehensive guide and reference manual for deep learning hyperparameter optimization and large-scale model training. It provides a structured, scientific framework for managing the complex trade-offs between model performance, computational resource consumption, and training throughput. By establishing a rigorous experimentation workflow, the resource enables practitioners to move beyond trial-and-error toward a systematic, data-driven approach to model development. The playbook distinguishes itself by emphasizing incremental tuning strategies and checkpoint-based evaluati

    Hyperparameter OptimizationLarge Scale TrainingExperimentation Workflows
    29,826View on GitHub↗
  • vllm-project/vllm

    vllm-project/vllm

    70,745View on GitHub↗

    vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token generation speed and memory efficiency, enabling both large-scale cloud deployments and local execution on personal hardware. The project distinguishes itself through advanced memory management and request scheduling techniques, most notably its use of non-contiguous key-value cach

    Continuous Batching StrategiesCustom Model Execution EnginesDistributed Model Servers
    70,745View on GitHub↗
  • modelcontextprotocol/servers

    modelcontextprotocol/servers

    79,000View on GitHub↗

    The Model Context Protocol is a standardized communication framework designed to connect language models to external data sources, functional tools, and interactive user interfaces. It provides a vendor-neutral interface layer that enables AI hosts to discover and execute capabilities across heterogeneous service environments, using a JSON-RPC based messaging standard to facilitate bidirectional communication between clients and servers. The protocol distinguishes itself through a robust capability-based handshake that negotiates feature sets during session initialization, ensuring compatibil

    AI Context Integration ProtocolsAI Agent Tool IntegrationsAI Interoperability Layers
    79,000View on GitHub↗
  • anthropics/skills

    anthropics/skills

    145,500View on GitHub↗

    This project provides a standardized framework for extending the functional range of artificial intelligence agents through a registry of modular, declarative instructions. It enables agentic workflow automation by allowing developers to define task-specific behaviors and operational constraints that guide how agents interact with external tools and execute multi-step processes. The system distinguishes itself through a directory-based discovery model and a plugin-registry architecture that facilitates the distribution of specialized workflows. By utilizing a schema-driven specification that

    Agent Configuration SpecificationsDeclarative Skill OrchestrationsAgent Capability Extensions
    145,500View on GitHub↗
  • affaan-m/everything-claude-code

    affaan-m/everything-claude-code

    48,404View on GitHub↗

    Everything Claude Code is an agentic framework designed to orchestrate complex software development workflows through specialized subagent delegation. It functions as a control plane that manages agent behavior, tool access, and context window efficiency, allowing developers to break down large tasks into focused, scoped sub-processes that prevent system overload. The framework distinguishes itself through a robust security and automation layer that includes automated static analysis and adversarial red-teaming to audit agent configurations. It enables the creation of reusable behavioral patt

    Agent OrchestrationAgent Orchestration LayersAgent Security Auditing
    48,404View on GitHub↗
  • oobabooga/text-generation-webui

    oobabooga/text-generation-webui

    46,070View on GitHub↗

    This project is a comprehensive platform for hosting and interacting with large language models directly on local hardware. It provides a web-based graphical interface that allows users to manage model loading, configure generation parameters, and execute text or chat interactions entirely offline. By running models locally, the software ensures complete data privacy and eliminates reliance on external cloud services for generative tasks. Beyond basic inference, the platform functions as a versatile workbench for generative AI development. It includes an integrated pipeline for fine-tuning mo

    Local Inference EnginesLocal Model RuntimesModel Fine-Tuning Tools
    46,070View on GitHub↗
  • PaddlePaddle/PaddleOCR

    PaddlePaddle/PaddleOCR

    70,931View on GitHub↗

    PaddleOCR is a comprehensive optical character recognition framework designed for detecting and transcribing text from images and documents into structured, machine-readable formats. It provides a modular computer vision pipeline that decouples image preprocessing, text detection, and character recognition into independent, configurable stages. This architecture supports automated document digitization and multilingual text recognition, capable of identifying text in over one hundred languages across diverse environments ranging from scanned documents to industrial scenes. The framework disti

    Modular Vision PipelinesMultilingual Text RecognitionDeep Learning
    70,931View on GitHub↗
  • suno-ai/bark

    suno-ai/bark

    38,980View on GitHub↗

    Bark is a generative audio engine and machine learning inference library designed to convert written text into high-fidelity speech and sound effects. It functions as a text-to-audio transformer, utilizing multi-stage neural network architectures to map semantic input tokens into detailed audio codebooks for synthesis. The system distinguishes itself through a hierarchical transformer stacking approach that separates semantic understanding from acoustic realization. By employing autoregressive token prediction and vector quantized codebook mapping, the engine bridges linguistic and sonic doma

    Generative Audio EnginesSpeech Synthesis ModelsText-to-Audio Synthesis
    38,980View on GitHub↗
  • 1Panel-dev/1Panel

    1Panel-dev/1Panel

    35,728View on GitHub↗

    1Panel is a centralized server management and container orchestration platform designed to simplify the administration of Linux-based infrastructure. It provides a unified web interface for managing containerized workloads, automating system maintenance, and configuring server resources. By acting as a comprehensive control plane, the platform streamlines the deployment of applications, databases, and web services while offering granular control over host system internals and security settings. What distinguishes this platform is its integrated support for private artificial intelligence infr

    AI Infrastructure ManagersContainer Orchestration PlatformsInfrastructure Automation Tools
    35,728View on GitHub↗
  • scikit-learn/scikit-learn

    scikit-learn/scikit-learn

    65,178View on GitHub↗

    Scikit-learn is a machine learning library for predictive data analysis that provides a collection of algorithms for supervised and unsupervised learning. It functions as a comprehensive toolkit for data preprocessing, dimensionality reduction, and model selection, allowing users to classify data objects, predict continuous values, and cluster similar items based on historical patterns. The project is defined by a unified interface design where objects either learn from data, transform data, or chain these operations into sequential workflows. To ensure performance on large or high-dimensiona

    Dimensionality Reduction EnginesFrameworksPipeline Patterns
    65,178View on GitHub↗
  • dair-ai/Prompt-Engineering-Guide

    dair-ai/Prompt-Engineering-Guide

    70,526View on GitHub↗

    This project is a comprehensive educational resource and technical guide focused on the development, optimization, and application of large language models. It provides a structured curriculum for mastering prompt engineering, ranging from foundational principles of instruction design to advanced techniques for improving model reasoning, accuracy, and reliability. The guide distinguishes itself by offering deep technical insights into agentic workflows and autonomous system design. It covers the implementation of multi-step reasoning chains, tool integration through function calling, and stat

    Agentic OrchestrationAgentic Orchestration FrameworksPrompt Engineering
    70,526View on GitHub↗
  • meta-llama/llama

    meta-llama/llama

    59,157View on GitHub↗

    Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on your own hardware. The system distinguishes itself through specialized memory and computation management techniques, including memory-mapped weight loading and quantization-aware inference, which allow for efficient execution on standard consumer hardware. It utilizes a stateles

    Inference EnginesLarge Language Model RuntimesLocal Inference Engines
    59,157View on GitHub↗
  • janhq/jan

    janhq/jan

    40,489View on GitHub↗

    Jan is a desktop application that functions as a local artificial intelligence model runtime and an open-standard API server. It enables the execution of large language models directly on local hardware, ensuring that data remains private and accessible offline while providing a unified interface for managing model weights and inference runtimes. The platform distinguishes itself by offering a modular inference backend that allows users to swap execution engines based on hardware compatibility and performance needs. It acts as a cross-platform orchestrator, providing the ability to switch bet

    Local Model RuntimesDesktop AI RuntimesOpenAI-Compatible Servers
    40,489View on GitHub↗
  • PatrickJS/awesome-cursorrules

    PatrickJS/awesome-cursorrules

    37,927View on GitHub↗

    This project is a curated library of configuration files designed to optimize the behavior of AI-assisted code editing environments. By providing structured instructions that define project constraints, coding standards, and technical preferences, it enables developers to standardize how artificial intelligence models interact with their codebases. These configuration files are integrated into the editor to ensure consistent output and improved accuracy during code generation. The repository distinguishes itself through a community-driven approach to curation, aggregating user-submitted rules

    AI Coding Assistant RulesAI Coding AssistantsAI Editor Configuration Libraries
    37,927View on GitHub↗
  • hiroi-sora/Umi-OCR

    hiroi-sora/Umi-OCR

    42,159View on GitHub↗

    Umi-OCR is an optical character recognition engine designed to convert visual text from images and documents into machine-readable character data. It functions as a local-first toolkit, processing all visual data directly on the host machine using embedded neural network models to maintain privacy and offline availability. The project distinguishes itself through its focus on automated document digitization and integrated barcode and QR code decoding. By utilizing a modular, Python-based orchestration layer, it enables users to transform static image files and multi-page documents into search

    Optical Character RecognitionLocal Inference EnginesDocument Analysis Tools
    42,159View on GitHub↗
  • openai/whisper

    openai/whisper

    94,839View on GitHub↗

    This project is a speech recognition and translation engine that utilizes a sequence-to-sequence transformer architecture to convert audio into text. It is built upon a weakly supervised learning framework, which leverages large-scale, unlabelled audio-transcript data to create generalized speech representations capable of performing simultaneous transcription, language identification, and translation. The system distinguishes itself through a unified multi-task modeling approach that shares token sequences across different objectives, allowing it to handle diverse languages and vocabularies

    Speech Recognition SystemsAutomatic Speech RecognitionAutomatic Speech Recognition Toolkits
    94,839View on GitHub↗
  • tesseract-ocr/tesseract

    tesseract-ocr/tesseract

    72,460View on GitHub↗

    Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into desktop, mobile, or server-side applications. By utilizing long short-term memory networks, the engine provides robust text extraction across more than one hundred languages and dozens of scripts. The project distinguishes itself through a sophisticated document layout analysis f

    OCR EnginesAutomated Digitization EnginesCommand-Line Document Processors
    72,460View on GitHub↗
  • ggml-org/llama.cpp

    ggml-org/llama.cpp

    95,400View on GitHub↗

    Llama.cpp is an inference engine designed for the local execution of text-based and multimodal language models on consumer hardware. It provides a core environment for running models that process both text and image inputs, utilizing hardware-accelerated backends to optimize performance across diverse CPU and GPU architectures. The project distinguishes itself by offering a lightweight HTTP server that adheres to standard API specifications, enabling chat completion, embeddings, and reranking services. It includes a suite of tools for model quantization and conversion, which reduces memory us

    Hardware Abstraction LayersText-Only Inference EnginesMultimodal Inference Engines
    95,400View on GitHub↗
  • khoj-ai/khoj

    khoj-ai/khoj

    32,535View on GitHub↗

    Khoj is a self-hosted artificial intelligence platform designed for personal knowledge management and semantic information retrieval. It functions as a private assistant that indexes your local documents, notes, and external workspaces, allowing you to interact with your data through natural language queries and conversational chat. By maintaining a local-first architecture, the system ensures that your information remains under your control while providing context-aware responses grounded in your personal knowledge base. The platform distinguishes itself through a modular, cross-platform int

    Personal AI AssistantsPersonal Knowledge Management SystemsSemantic Search Engines
    32,535View on GitHub↗
  • open-mmlab/mmdetection

    open-mmlab/mmdetection

    32,409View on GitHub↗

    This project is a modular research toolkit designed for developing, training, and evaluating deep learning models for object detection, segmentation, and video instance tracking. It provides a flexible training engine that manages complex neural network execution, including distributed training, custom lifecycle hooks, and weight optimization. The framework is built around a hierarchical configuration system that allows users to define architectures, data pipelines, and training hyperparameters through composable, inheritable files. The project distinguishes itself through its highly modular

    Computer Vision ToolkitsObject DetectionTraining Pipelines
    32,409View on GitHub↗
Prev1…32333435Next

Explore more curated searches

Hand-picked prompts, ranked by AI.

  • Supercharge your terminal and dev setup
  • Free resources to learn to code
  • Modern frontend frameworks and UI kits
  • Learn by building real-world projects
  • Open-source AI agents and local LLMs
  • Self-host apps to replace paid SaaS