321 repos

Awesome GitHub repositories, curated.

A community-curated directory of interesting public GitHub repositories. Ask in plain English — AI ranks by relevance. Save what you find.

We'll search the best matching repositories with AI.

laurent22/joplin
53,497
Joplin is an open-source, cross-platform note-taking application designed for secure, private knowledge management. It functions as a local-first productivity platform, maintaining a complete relational database on the user's device to ensure offline availability and high-performance data retrieval. The application prioritizes data sovereignty by implementing an end-to-end encryption layer, which secures all information locally with a master key before any synchronization occurs. The platform distinguishes itself through a delta-based synchronization engine that transmits only specific file changes, optimizing performance across multiple devices and operating systems. Users can extend the core environment through a plugin-based architecture that supports custom themes, scripts, and UI components. For professional or collaborative environments, the software offers self-hosted synchronization options and team management capabilities, allowing organizations to maintain full control over their data infrastructure and security policies. Beyond core note-taking, the application supports rich multimedia content, including embedded files, diagrams, and mathematical expressions. It provides a comprehensive web-clipping tool for archiving online research and a RESTful API that enables programmatic access to notes and metadata for external integrations. The system is built on a cross-platform abstraction layer to ensure consistent behavior across desktop and mobile environments.
androiddropboxelectron
laurent22/joplin
pi-hole/pi-hole
55,771
Pi-hole is a self-hosted network utility that functions as a DNS sinkhole server to provide network-wide ad blocking. By acting as a dedicated network gateway, it intercepts and discards requests for known advertising, tracking, and malicious domains across an entire local network, preventing unwanted content from loading on any connected device. The software operates through a lightweight background daemon that handles high volumes of concurrent DNS queries with minimal resource overhead. It utilizes a host-file injection mechanism to redirect traffic toward its local filtering engine and applies regex-based pattern matching to identify and block specific domain requests. Users manage these operations and monitor network traffic statistics through a centralized, web-based configuration interface. Beyond blocking, the project provides tools for comprehensive DNS traffic management and home network security. By resolving domain names locally, it offers increased visibility into outgoing internet traffic and helps optimize network performance by preventing the download of resource-heavy tracking scripts and advertisements.
ad-blockerblockercloud
pi-hole/pi-hole
clash-verge-rev/clash-verge-rev
97,701
This application provides a comprehensive interface for managing network traffic through a core proxy engine. It supports multiple traffic interception methods, including system-wide proxy settings and virtual network interfaces, allowing users to route TCP and UDP traffic based on specific domain, IP, port, or process criteria. The system facilitates complex network configurations through proxy chaining, rule-based routing, and the aggregation of multiple remote subscription sources. Beyond core networking, the tool includes developer-focused utilities for configuration management and system diagnostics. Users can modify configuration objects using a sandboxed scripting engine or automate imports via URL-based protocols and custom response headers. The application also offers administrative service modes for elevated privilege management and provides tools for visual interface customization, including support for custom style sheets and icon management.
clashclash-metaclash-verge
clash-verge-rev/clash-verge-rev
tensorflow/models
77,684
This repository serves as a centralized collection of state-of-the-art deep learning architectures and reference implementations designed for research and application development. It provides a comprehensive toolkit for computer vision and natural language processing, offering pre-built models and training pipelines for tasks ranging from image classification and object detection to complex sequence modeling. The project distinguishes itself by providing a flexible execution harness that manages the entire training lifecycle, including data ingestion and backpropagation. It supports scalable training across distributed hardware environments through collective communication primitives and utilizes configuration-driven experimentation to decouple hyperparameters from source code. By structuring neural architectures through hierarchical class compositions and employing checkpoint-based state persistence, the repository ensures that research workflows remain modular, reproducible, and fault-tolerant. These implementations demonstrate industry-standard patterns for constructing and deploying neural networks, including optimized graph-based execution for hardware acceleration. The repository functions as a reference for best practices in deep learning, providing documented examples for vision, language, and training loop management.
tensorflow/models
usememos/memos
57,067
Memos is a self-hosted, container-native knowledge management platform designed for capturing and organizing personal notes. It functions as a private workspace where users can create content using markdown, tags, and media embeds to streamline daily productivity. The system is built to be deployed as a portable service, allowing individuals to maintain full control over their data and hosting environment. Beyond its core note-taking capabilities, the platform operates as a headless content service that exposes a structured RESTful API. This interface allows for programmatic interaction, enabling users to automate tasks, synchronize information with external tools, and build custom clients. The system supports secure authentication through personal access tokens and provides event-driven webhook integration to trigger external workflows whenever content is created or modified. The application is designed for flexible production environments, supporting multiple relational database backends and configuration via environment variables. It includes administrative tools for managing user roles, instance settings, and data backups, ensuring that the platform can be tailored to specific organizational or personal requirements. The service is optimized for deployment through container runtimes, with built-in support for reverse proxy configurations to handle secure traffic and public link generation.
dockerfossgo
usememos/memos
junegunn/fzf
77,987
This project is a general-purpose command-line filter that provides an interactive interface for processing standard input streams. It enables real-time fuzzy searching, data selection, and transformation, allowing users to navigate complex information or file systems directly within their terminal. By utilizing a pipe-oriented architecture, it integrates into existing shell pipelines and workflows to facilitate efficient data exploration. What distinguishes this tool is its highly extensible, event-driven design that allows for deep integration with external processes. It supports asynchronous data transformation and dynamic list reloading, enabling users to trigger shell commands or update content based on user interactions without blocking the interface. The system maintains selection identity across these updates, providing a consistent experience when managing large or streaming datasets. The project offers a comprehensive suite of features for terminal user interface development, including multi-threaded search performance, configurable preview windows, and support for various terminal multiplexers. It provides extensive customization options for visual layout, key bindings, and search logic, allowing developers to build custom selection interfaces or automate complex shell tasks. The tool is configured through environment variables and configuration files, supporting inline comments for maintainability. It is designed to be installed as a standalone command-line utility, with library integration options available for embedding its filtering capabilities into other applications.
bashclifish
junegunn/fzf
zed-industries/zed
75,634
Zed is an AI-native, high-performance code editor designed for extreme responsiveness and keyboard-centric workflows. It functions as an extensible text processing workspace that integrates autonomous agents and predictive models directly into the development environment to automate complex engineering tasks, refactoring, and code generation. The editor distinguishes itself through a GPU-accelerated rendering pipeline and an asynchronous multi-threaded architecture that ensures low-latency interaction even with large-scale projects. It features built-in support for real-time, multi-user collaboration using conflict-free replicated data types, allowing for synchronized editing sessions. Users can leverage both local machine learning model execution for data privacy and external AI service integrations to power inline assistance and agentic workflows. The platform provides comprehensive language-aware analysis by acting as a standards-compliant client for external language servers, enabling intelligent diagnostics, completions, and structural navigation. Its modular design supports a customizable environment where developers can manage language extensions, define keybindings, and utilize command-driven navigation to streamline their specific coding requirements.
gpuirust-langtext-editor
zed-industries/zed
firstcontributions/first-contributions
52,672
This project is an educational resource designed to lower the barrier to entry for new developers learning how to participate in open-source software development. It provides a safe, guided practice environment where beginners can master the fundamental workflows required to contribute to public repositories. The project distinguishes itself by offering a hands-on, interactive tutorial that walks users through the complete lifecycle of a contribution. By following structured steps—including forking, branching, committing, and submitting a pull request—participants gain practical experience with distributed version control systems. This process is specifically curated to build confidence in novice developers as they navigate the standard procedures of technical communities. Beyond the core tutorial, the repository covers essential best practices for collaborative development, such as identifying suitable projects, reading documentation, and adhering to community guidelines. The entire experience is documented through plain text files, ensuring that the learning materials remain accessible and easy to follow for anyone starting their journey in open-source collaboration.
beginnerbeginner-friendlycontribute
firstcontributions/first-contributions
PKUFlyingPig/cs-self-learning
71,351
This project is a centralized repository and academic resource aggregator designed to guide students through a structured computer science curriculum. It provides a comprehensive roadmap of foundational courses and technical materials, helping learners navigate the transition from introductory programming to advanced software engineering proficiency. The repository distinguishes itself through a community-driven approach, where study paths and resource collections are refined and expanded via peer feedback and collaborative contributions. By organizing high-quality lecture notes, assignments, and reading lists from top-tier university programs into a logical progression, it enables self-directed learners to bridge technical skill gaps and optimize their academic performance. The content is maintained as a version-controlled collection of markdown files, ensuring that the learning path remains transparent and accessible. This documentation is compiled into a static format, allowing users to navigate complex academic sequences and track their progress across platforms without the need for dynamic backends.
PKUFlyingPig/cs-self-learning
infiniflow/ragflow
73,425
This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasoning workflows. By integrating document intelligence with advanced retrieval pipelines, the platform enables the creation of grounded, verifiable responses supported by traceable citations. The platform distinguishes itself through deep document understanding and sophisticated knowledge orchestration. It supports complex document parsing, including the extraction of tables and images, and utilizes graph-based indexing to enhance reasoning over large document collections. Users can configure multiple recall strategies and fused re-ranking to optimize retrieval accuracy, while the system maintains context through multi-turn dialogue management and flexible tool-use frameworks. The architecture is built on a modular, containerized microservice foundation that supports both local inference engines and external language model APIs. It includes asynchronous task processing for document ingestion and indexing, ensuring system responsiveness during heavy workloads. The platform also provides a standardized interface for model abstraction, allowing for seamless integration with existing language model ecosystems. Developers can interact with the platform through a comprehensive suite of RESTful endpoints and Python client libraries, which cover the full lifecycle of agents, datasets, and knowledge graphs. The system is designed for flexible deployment, offering configurable environment settings and support for custom containerized environments to facilitate local development and infrastructure portability.
agentagenticagentic-ai
infiniflow/ragflow
puppeteer/puppeteer
93,606
Puppeteer is a browser automation library that provides a programmatic interface for controlling web browsers to execute tasks, simulate user interactions, and perform end-to-end testing. It functions as a headless browser controller, managing browser lifecycles, isolated session contexts, and remote connections to facilitate stable, automated web-based workflows. The library distinguishes itself through its deep integration with the Chrome DevTools Protocol, utilizing a bidirectional message bus to execute commands and receive real-time event notifications. It supports advanced automation patterns, including the registration and execution of custom tools within the browser environment and the ability to simulate diverse device characteristics and network conditions. By maintaining isolated browser contexts, it prevents data leakage between concurrent tasks, ensuring predictable environments for complex testing scenarios. Beyond core automation, the project serves as a comprehensive instrumentation and diagnostic suite. It enables developers to capture performance traces, inspect accessibility trees for compliance auditing, and generate high-fidelity visual artifacts such as screenshots and PDFs. Additionally, it functions as a server-side rendering engine, capable of crawling dynamic single-page applications to produce pre-rendered static content for improved search engine indexing.
automationchromechromium
puppeteer/puppeteer
mlabonne/llm-course
75,340
This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as well as the practical implementation of supervised instruction fine-tuning and preference-based model alignment. The repository distinguishes itself by providing a deep dive into advanced model composition and optimization techniques. It details methodologies for weight-space model merging and mixture-of-experts strategies, alongside practical guidance on low-precision parameter quantization and inference optimization to manage hardware requirements. Furthermore, it explores the development of autonomous agentic systems capable of tool-use orchestration and the construction of retrieval-augmented generation pipelines to ground model outputs in external data. The content spans the entire technical stack, from foundational deep learning concepts and neural network design to the complexities of deploying, evaluating, and securing models in production environments. It includes a curated collection of technical articles, blog posts, and interactive notebooks that track state-of-the-art research trends and experimental methodologies in generative artificial intelligence.
courselarge-language-modelsllm
mlabonne/llm-course
Shubhamsaboo/awesome-llm-apps
96,116
This repository serves as a comprehensive collection of resources, templates, and starter code for building artificial intelligence applications. It provides a centralized hub for developers to access practical implementations of common workflows, including retrieval-augmented generation pipelines and autonomous agent loops, alongside educational materials designed to support rapid prototyping and experimentation. The project distinguishes itself by offering a dual focus on technical implementation and critical analysis. It provides a library of lightweight, single-file agents and tutorials for complex tasks like multi-source retrieval, memory management, and tool integration via standardized protocols. Simultaneously, it includes an analytical framework for identifying and evaluating the linguistic patterns, structural templates, and stylistic markers characteristic of machine-generated text. Beyond these core offerings, the repository covers a broad capability surface that includes guidance on model fine-tuning, voice-processing integration, and strategies for optimizing agent reasoning and token consumption. It also features conceptual resources regarding the evolving role of product management in agent-driven environments and best practices for mitigating performance issues in autonomous systems. The repository is structured as a curated list with a navigation index, providing quick-start instructions for initializing and running template agents within a local development environment.
agentsllmspython
Shubhamsaboo/awesome-llm-apps
rails/rails
58,297
This project is a full-stack web framework designed for building database-backed applications through a standardized architectural pattern. It provides a comprehensive suite of integrated libraries that manage the entire request-response lifecycle, from routing incoming web traffic to rendering dynamic server-side templates. By utilizing an object-relational mapping layer, the framework allows developers to define domain models that map database tables directly to application objects, simplifying data persistence, schema migrations, and complex relationship management. The framework is distinguished by its commitment to convention over configuration, which reduces manual setup by using predefined naming patterns and directory structures to wire components together. It employs a model-view-controller architecture to separate application logic into distinct layers, supported by a modular middleware pipeline that handles cross-cutting concerns like authentication and session management. These features are complemented by built-in utilities for background job processing, real-time communication, and file storage, enabling the creation of complex, scalable services within a single cohesive environment. Beyond core development, the framework includes an extensive suite of infrastructure tools to support the entire software lifecycle. This includes automated testing and quality inspection capabilities, security vulnerability scanning, and specialized helpers for production deployment and performance optimization. Developers can further extend the framework by building custom plugins, engines, and middleware to meet specific project requirements.
activejobactiverecordframework
rails/rails
vuejs/core
53,019
Vue is a progressive JavaScript framework designed for building modular, reactive user interfaces. It utilizes a component-based architecture that allows developers to encapsulate logic, templates, and styles into reusable units. At its core, the framework employs a virtual DOM renderer and a proxy-based reactivity system to synchronize application state with the document object model efficiently. What distinguishes this framework is its focus on developer experience and flexibility. It supports a single-file component format that colocalizes related concerns, alongside a powerful composition API for organizing complex logic into reusable functions. The framework also provides advanced build-time optimizations, such as template compilation hints that minimize unnecessary tree traversals, and robust support for TypeScript to ensure type safety across component props, events, and reactive state. The framework covers a broad capability surface, including built-in tools for managing asynchronous component loading, content teleportation, and sophisticated animation sequences. It offers comprehensive support for server-side rendering to improve search engine optimization and initial load performance, as well as interoperability features for integrating with standard web components. Additionally, it includes utilities for dependency injection, global state management, and client-side routing to support the development of scalable, stateful applications. The project provides extensive documentation and tooling, including command-line scaffolding and IDE support, to assist with project configuration, testing, and quality assurance.
vuejs/core
karpathy/nanoGPT
53,461
nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predict subsequent elements. The project distinguishes itself through a focus on high-speed data ingestion and hardware-accelerated performance. It includes a dedicated pipeline for transforming raw text into memory-mapped binary files, which enables efficient streaming during training. To maximize throughput, the system supports distributed data parallelism across multiple graphics processing units and employs just-in-time kernel compilation to optimize mathematical operations for specific hardware. Beyond core training capabilities, the repository provides a command-line interface for generative text inference, allowing users to sample sequences from trained models using configurable parameters. It also includes integrated benchmarking tools to measure iteration speeds and identify hardware bottlenecks, ensuring efficient model development across various configurations.
karpathy/nanoGPT
microsoft/generative-ai-for-beginners
106,618
This project is a comprehensive, open-source educational curriculum designed to guide developers through the mastery of generative artificial intelligence. It provides a structured learning path that covers foundational concepts, prompt engineering, and the practical application of large language models. The repository serves as a central hub for skill acquisition, offering sequential modules that progress from basic model mechanics to advanced architectural patterns. The curriculum distinguishes itself by focusing on the end-to-end lifecycle of intelligent software, including the implementation of retrieval-augmented generation and agentic workflow orchestration. It provides technical guidance on integrating diverse models—ranging from open-source options to cloud-based services—while emphasizing responsible development through systematic safety guardrails and ethical design practices. Learners are equipped to build functional applications, such as conversational interfaces, semantic search tools, and automated content generators, using standardized interfaces and modern development techniques. Beyond core model implementation, the resource covers operational practices for monitoring and maintaining AI systems in production. It includes practical modules on fine-tuning, vector-based indexing, and designing intuitive user experiences for intelligent systems. The repository is structured to support developers through every stage of the process, from initial environment configuration and dependency management to deployment readiness and troubleshooting.
aiazurechatgpt
microsoft/generative-ai-for-beginners
angular/angular.js
58,970
AngularJS is a structural framework for building dynamic web applications by extending standard HTML with custom tags and attributes. It operates as a client-side template engine that transforms declarative markup into interactive components, organizing application logic through a model-view-controller pattern. By utilizing a centralized dependency injection container, the framework manages the lifecycle of services and components to ensure modularity and maintainable architecture. The framework is defined by its two-way data binding mechanism, which automatically synchronizes data models with the user interface. It achieves this through dirty-checking, where the system periodically compares model snapshots to propagate changes between the view and the underlying data. This process is supported by hierarchical scope inheritance, allowing nested components to access and modify parent data models, and expression-based evaluation that enables dynamic logic directly within the document markup. Beyond its core rendering and binding capabilities, the project provides a comprehensive suite of tools for application development. This includes a service-oriented architecture for encapsulating business logic, built-in data transformation filters, and extensive support for automated testing, covering both isolated unit tests and end-to-end browser workflows. The framework also offers granular control over document elements, including conditional rendering, event handling, and input validation.
angular/angular.js
tesseract-ocr/tesseract
72,460
Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into desktop, mobile, or server-side applications. By utilizing long short-term memory networks, the engine provides robust text extraction across more than one hundred languages and dozens of scripts. The project distinguishes itself through a sophisticated document layout analysis framework that employs a hybrid approach to resolve complex structures like multi-column text and tables. It offers extensive configurability, allowing users to refine recognition accuracy through custom linguistic models, user-defined dictionaries, and specialized training pipelines. The engine supports the generation of various structured outputs, including searchable PDFs with hidden text layers, and provides hardware-accelerated math kernels to optimize inference performance. Beyond core recognition, the system includes comprehensive tooling for image pre-processing, page segmentation, and the management of modular language data. It provides C and C++ APIs alongside various language-specific wrappers, enabling integration into diverse software environments. The engine is available as pre-built binary packages or can be compiled from source using standard system compilers.
hacktoberfestlstmmachine-learning
tesseract-ocr/tesseract
denoland/deno
106,258
Deno is a high-performance runtime for JavaScript and TypeScript that prioritizes security and developer productivity. Built on the V8 engine, it provides a secure execution environment that enforces a default-deny security model, requiring explicit user authorization for access to system resources like the file system, network, and environment variables. The runtime natively supports modern web-standard APIs, ensuring consistent behavior and portability across different environments. What distinguishes Deno is its integrated approach to the software development lifecycle. It bundles essential utilities—including a formatter, linter, test runner, and dependency manager—directly into the runtime, eliminating the need for external build tools or complex transpilation steps. The platform features a universal module resolution system that supports remote HTTPS URLs, local paths, and standard package registries, all backed by lockfiles to ensure build determinism and supply chain security. Beyond its core runtime capabilities, Deno includes a built-in, persistent key-value database engine that supports atomic transactions and reactive data monitoring. It also provides a robust compatibility layer for the Node.js ecosystem, allowing for the seamless execution of legacy modules and native binary addons. For multi-tenant or distributed applications, the runtime offers isolated sandbox environments that manage resource constraints and security boundaries, facilitating secure code execution in shared infrastructure. The project is distributed as a single binary, providing a unified toolchain for managing dependencies, executing tasks, and configuring runtime security policies.
denojavascriptrust
denoland/deno

Browse repositories

laurent22/joplin

pi-hole/pi-hole

clash-verge-rev/clash-verge-rev

tensorflow/models

usememos/memos

junegunn/fzf

zed-industries/zed

firstcontributions/first-contributions

PKUFlyingPig/cs-self-learning

infiniflow/ragflow

puppeteer/puppeteer

mlabonne/llm-course

Shubhamsaboo/awesome-llm-apps

rails/rails

vuejs/core

karpathy/nanoGPT

microsoft/generative-ai-for-beginners

angular/angular.js

tesseract-ocr/tesseract

denoland/deno