321 repos

Awesome GitHub repositories, curated.

A community-curated directory of interesting public GitHub repositories. Ask in plain English — AI ranks by relevance. Save what you find.

We'll search the best matching repositories with AI.

jackfrued/Python-100-Days
178,734
This project is a comprehensive, day-by-day curriculum designed to guide learners through the Python programming language and its professional applications. The content spans from fundamental syntax and object-oriented design to advanced topics including database management, web development, data analysis, and machine learning. The curriculum is structured into distinct modules that cover practical software engineering practices, such as version control, containerization, and system architecture. It also provides resources for technical interview preparation and an analysis of career paths within the software development and data science ecosystems. The material is delivered through a series of structured lessons and practical exercises.
jackfrued/Python-100-Days
rclone/rclone
55,637
This project is a command-line storage manager that provides a unified interface for performing file operations across local filesystems and diverse cloud storage providers. It functions as a cross-platform storage abstraction, utilizing a modular backend architecture to map heterogeneous cloud storage APIs into a standard set of file system operations. This allows for consistent data management and movement regardless of the underlying storage service. The tool serves as a network data transfer engine designed for automated data migration and cloud storage synchronization. It distinguishes itself by offering granular control over transfer behavior, allowing users to manage bandwidth, logging, and file handling rules through global command-line flags. Furthermore, it includes a metadata transformation pipeline that intercepts and modifies file attributes during transit to ensure compatibility and consistency between disparate storage environments. Beyond core synchronization, the software provides secure remote file management by enforcing strict authentication and encrypted network communication protocols. It includes diagnostic instrumentation to monitor system performance, enabling users to analyze resource usage and identify bottlenecks during large-scale data operations. Users can configure and persist storage backend credentials through an interactive command-driven utility.
azure-blobazure-blob-storageazure-files
rclone/rclone
ngosang/trackerslist
52,988
This project is a curated, community-driven registry of public BitTorrent trackers designed to facilitate peer-to-peer file sharing. It serves as a centralized resource for network endpoints that coordinate connections between distributed clients, helping users discover and maintain reliable infrastructure for decentralized communication protocols. The repository distinguishes itself through a fully automated orchestration pipeline that ensures the lists remain current and accurate. Every day, background tasks perform distributed health monitoring to verify connectivity and filter out unresponsive service providers. The system further refines these lists by applying deduplication logic to remove redundant entries and utilizing a blacklist to exclude malicious or non-compliant endpoints. To optimize performance for end-users, the collected trackers are ranked based on popularity and measured latency. The project provides these verified resources as static assets, offering high-availability access to tracker lists in multiple formats without requiring a live database backend. These lists are intended to support BitTorrent client optimization, allowing users to inject reliable trackers into their torrent files and magnet links to improve swarm connectivity and download speeds.
bittorrentbittorrent-trackerbittorrent-trackers
ngosang/trackerslist
mermaid-js/mermaid
86,200
This project is a client-side rendering engine that transforms declarative, text-based syntax into visual diagrams directly within the browser. By utilizing a domain-specific language, it allows users to define complex structures—such as software architectures, process flows, and system behaviors—without the need for manual layout configuration. The library functions as a browser-based runtime that parses these definitions into intermediate abstract syntax trees, which are then processed by specialized engines to generate high-fidelity, resolution-independent graphics. The system distinguishes itself through a modular architecture that decouples diagram types into independent plugins, allowing for a wide range of visualizations including sequence diagrams, entity relationship models, and project timelines. To ensure security when processing untrusted input, the library supports sandboxed rendering within isolated frames. It also features automatic rendering capabilities, which monitor the document object model to detect and visualize diagram definitions embedded within standard web content. Beyond its core rendering engine, the project supports a documentation-as-code workflow by integrating with various development environments, productivity platforms, and content frameworks. This enables the inclusion of version-controlled, dynamic visuals in technical guides and wikis. The library is designed for flexible deployment, offering support for content delivery network integration to facilitate implementation without requiring local build processes.
diagramsdiagrams-as-codedocumentation
mermaid-js/mermaid
yt-dlp/yt-dlp
147,702
This project is a command-line media downloader designed for the systematic retrieval and organization of digital content from diverse online platforms. It functions as an extensible extraction engine that utilizes a declarative format-selection pipeline to automate the identification, merging, and downloading of specific audio and video streams based on user-defined criteria. The system distinguishes itself through a modular architecture that supports custom plugins and site-specific scripts, allowing for the bypass of platform restrictions and the handling of complex authentication challenges. It features a robust media processing orchestrator that manages external dependencies to perform automated transcoding, remuxing, and metadata manipulation. By simulating legitimate client behavior through request impersonation and multi-threaded fragment transfers, the tool ensures reliable data retrieval even in restrictive network environments. Beyond its core downloading capabilities, the project provides a comprehensive suite of tools for media archiving and programmatic integration. It includes support for advanced metadata extraction, template-based filesystem organization, and post-processing hooks that allow for custom workflows such as segment filtering or automated file tagging. The software can be embedded directly into other applications via a native programming interface, offering a flexible foundation for custom media management pipelines. The tool is available as a standalone binary or via standard package managers, with support for configuration through environment variables, external scripts, and secure credential management.
clidownloaderpython
yt-dlp/yt-dlp
papers-we-love/papers-we-love
103,417
Papers We Love is a community-driven repository and learning network dedicated to the study and discussion of foundational computer science literature. It functions as a centralized educational archive, providing a structured environment where software professionals can engage with academic research to bridge the gap between theoretical concepts and practical application. The project distinguishes itself through a decentralized model of crowdsourced curation, where community members collectively maintain and categorize a vast index of technical resources. Beyond the repository itself, the initiative supports a global network of autonomous regional chapters that operate under shared governance standards to facilitate in-person knowledge sharing. This ecosystem is further supported by an extensive library of archived expert presentations and curated reading methodologies designed to improve technical literature literacy. The platform organizes its scholarly resources through a hierarchical directory structure, enabling efficient navigation and version-controlled tracking of academic content. It provides tools for discovering external research repositories, establishing contribution standards for collaborative growth, and developing community-focused applications that extend the utility of the shared knowledge base.
awesomecomputer-sciencemeetup
papers-we-love/papers-we-love
electron/electron
120,164
This framework provides a multi-process architecture for building desktop applications using web technologies. It manages the application lifecycle, window states, and system-level integrations through a primary entry point, while isolating web content in separate rendering processes to maintain stability and security. A secure bridge mechanism facilitates communication between these isolated contexts and the main process, ensuring that privileged system APIs remain protected. The framework distinguishes itself through a comprehensive security model that includes process sandboxing, content policy enforcement, and strict validation of inter-process communication. It offers specialized tooling for native module management, allowing developers to integrate binary dependencies across different architectures. Furthermore, the system includes built-in support for accessibility management and automated testing via standard browser-automation protocols. Developers have access to a suite of utilities for performance optimization, including code bundling, background task offloading, and resource profiling. The framework also provides a complete toolset for packaging applications and generating platform-specific installers for distribution.
c-plus-pluschromecss
electron/electron
WerWolv/ImHex
52,656
ImHex is a professional-grade hex editor and binary data analysis platform designed for inspecting, modifying, and reverse engineering raw file contents. It functions as a schema-driven engine that interprets complex binary structures by applying custom definitions to map and visualize byte-level data. The platform distinguishes itself through a dedicated domain-specific language that allows users to define structural schemas for automated file parsing. This capability is supported by a dynamic plugin architecture and an event-driven registry, which enable the integration of external modules to extend core functionality and support specialized file formats. The system utilizes memory-mapped file access to handle large datasets and provides an immediate mode graphical interface for responsive data visualization. Users can maintain and share collections of format definitions to standardize the interpretation of various binary types, while a development kit facilitates the creation of custom extensions for specific analysis requirements.
analyzerbinary-analysisc-plus-plus
WerWolv/ImHex
TheAlgorithms/Python
217,914
This project is a comprehensive repository of verified computational implementations designed to serve as an educational resource for computer science and algorithmic problem solving. It provides a structured collection of code examples that cover fundamental data structures, mathematical operations, and core programming concepts, allowing users to study the logic and complexity behind various computational methods. The repository distinguishes itself through a modular, reference-based implementation pattern that organizes code into logical namespaces. This approach facilitates independent execution and educational clarity, enabling users to explore the evolution of computational strategies from naive brute-force approaches to optimized, high-performance solutions. By decoupling data structure abstractions from algorithmic operations, the project ensures that implementations remain interchangeable and easy to analyze. The capability surface spans a wide range of technical domains, including machine learning, cryptography, scientific computing, and computer vision. It includes implementations for predictive modeling, neural networks, and statistical analysis, alongside tools for digital signal processing, network flow management, and financial modeling. The collection also addresses specialized mathematical needs, such as linear algebra, geometric calculations, and bit manipulation, providing a broad foundation for research and engineering applications.
algorithmalgorithm-competitionsalgorithms-implemented
TheAlgorithms/Python
marktext/marktext
53,968
Marktext is a cross-platform desktop application designed for markdown document authoring and structured note-taking. It functions as a WYSIWYG text processor, providing a distraction-free interface that renders formatted content in real-time while hiding the underlying markup syntax. The application utilizes a multi-process architecture that separates system integration from the user interface, ensuring consistent performance across Windows, macOS, and Linux. By employing a custom editor core built on native browser capabilities and a structured syntax tree, it manages complex document elements such as mathematical expressions, diagrams, and code blocks. The software includes a plugin-based extension system that allows for the injection of custom functionality and interface components. It is distributed as an open-source project, maintaining a consistent environment for technical documentation and personal knowledge management.
dark-modeeditorelectron
marktext/marktext
rust-lang/rustlings
61,890
Rustlings is a command-line learning tool designed to build language proficiency through a structured, interactive curriculum. It functions as a practice-oriented platform where users master syntax and core concepts by resolving compilation errors within a sequence of small, incremental code exercises. The environment distinguishes itself by utilizing a compiler-driven feedback loop that parses error messages to provide targeted hints for fixing logic and syntax issues. Progress is managed through a file-based system where users modify incomplete source templates, which are then verified against the official language toolchain to ensure the exercises reflect real-world development workflows. The platform supports self-paced skill acquisition by monitoring source file changes in real-time, allowing for immediate re-compilation and rapid feedback. This approach reinforces programming fundamentals by requiring users to successfully compile each challenge before advancing to more complex topics.
beginner-friendlyexercisesrust
rust-lang/rustlings
Stirling-Tools/Stirling-PDF
74,357
Stirling-PDF is a self-hosted document processing suite designed for secure, private file management. It functions as a comprehensive transformation engine that executes complex operations—such as merging, splitting, converting, and redacting documents—directly on the host machine. The platform provides both a browser-based interface for interactive editing and a programmatic, API-first architecture that allows for the automation of document workflows through standard HTTP requests. The project distinguishes itself through its focus on private, infrastructure-agnostic deployment and granular security. It supports role-based access control and stateless session authentication, ensuring that sensitive operations remain protected within a user-controlled environment. By offering a unified interface for sequential file transformations, it enables users to chain multiple processing tasks into single, automated pipelines while maintaining full control over document integrity and security. The system covers a broad range of document manipulation capabilities, including optical character recognition, digital signature validation, and advanced layout operations like booklet imposition and page reorganization. It is built for flexible integration, supporting deployment across containerized environments, bare metal, or native desktop installations. Configuration is managed through environment variables, YAML files, or the web interface, allowing for consistent behavior across diverse infrastructure setups.
dockerhacktoberfestjava
Stirling-Tools/Stirling-PDF
meta-llama/llama
59,157
Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on your own hardware. The system distinguishes itself through specialized memory and computation management techniques, including memory-mapped weight loading and quantization-aware inference, which allow for efficient execution on standard consumer hardware. It utilizes a stateless request execution model and a tensor-based computation graph to handle token-based sequence processing, ensuring that each inference task operates independently without reliance on persistent server state. This project provides the necessary tools for local large language model deployment, including a command-line interface for retrieving authorized model checkpoints and configuration files. It supports offline research and the integration of text generation capabilities into custom software applications, allowing users to manage model parameters such as sequence length and batch size to meet specific performance requirements.
meta-llama/llama
deepseek-ai/DeepSeek-V3
101,631
DeepSeek-V3 is a large language model that provides comprehensive resources for model utilization, including technical specifications, pre-trained weights, and evaluation benchmarks. The project details the core transformer architecture, including parameter counts and multi-token prediction modules, while supporting native 8-bit floating-point quantization. The repository offers extensive support for local and distributed inference through integration with multiple frameworks and engines. It includes documentation for deploying the model across various hardware configurations, such as GPUs and specialized accelerators, alongside guidance for environment setup and weight conversion. Users can access the model through a web-based chat interface, a programmatic API, or by downloading weights for local execution.
deepseek-ai/DeepSeek-V3
microsoft/markitdown
87,305
This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine-readable content. The toolkit distinguishes itself through a modular, plugin-based architecture that orchestrates multi-stage extraction pipelines. Users can steer the parsing behavior by injecting custom instructions, enabling the system to adapt to domain-specific document structures and formatting requirements. This flexibility is supported by an integrated optical character recognition capability that ensures text recovery from embedded images during the conversion process. The system provides both a command-line interface and a programmatic library, facilitating automated batch processing and custom integration into data pipelines. To ensure consistent performance across different environments, the project supports deployment within containerized architectures that encapsulate all necessary system-level dependencies and binaries.
autogenautogen-extensionlangchain
microsoft/markitdown
langchain-ai/langchain
127,015
LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows that manage state, memory, and tool execution. The project distinguishes itself through a durable execution runtime that maintains persistent state across long-running processes by checkpointing progress to external storage. It models agent workflows as directed graphs, allowing for explicit node-to-node routing and state management. Furthermore, it includes a human-in-the-loop control layer that enables developers to pause execution at defined breakpoints, allowing for manual inspection, modification, and approval of agent actions during runtime. Beyond its core orchestration capabilities, the framework supports a tiered memory architecture that separates short-term conversation context from long-term persistent data. It also provides comprehensive observability tools for tracing and monitoring execution flows, alongside security features for managing authentication and fine-grained access control. The platform is supported by extensive documentation and standardized interfaces for models, embeddings, and data sources to facilitate the development of production-grade agentic systems.
agentsaiai-agents
langchain-ai/langchain
tailwindlabs/tailwindcss
93,668
Utility-first CSS framework for fast, design-system-friendly styling.
cssframeworkutility
tailwindlabs/tailwindcss
MisterBooo/LeetCodeAnimation
76,717
LeetCodeAnimation is an educational code archive and technical interview resource designed to help developers master complex programming concepts. It functions as a centralized repository of source code and instructional materials, providing a structured environment for self-paced learning of fundamental computer science algorithms and data structures. The project distinguishes itself by integrating visual algorithm simulations directly into its learning path. By mapping static educational content to animated media files, it demonstrates the step-by-step execution flow and internal state changes of sorting logic and data structures. This approach bridges the gap between abstract theoretical concepts and practical, executable code implementations. The repository utilizes cross-referenced indexing and markdown-based documentation to organize its knowledge base. It aggregates technical explanations and code samples into a unified structure, allowing users to navigate between problem identifiers, descriptive articles, and visual assets to support their preparation for technical assessments.
animationleetcodeleetcode-c
MisterBooo/LeetCodeAnimation
swiftlang/swift
69,781
Swift is a high-performance, general-purpose programming language designed for safety and speed. It features a modular compiler front-end that transforms source code into optimized machine binaries, utilizing a value-oriented type system that prioritizes predictable state management through value and reference types. The language is built on a task-based concurrency model that schedules asynchronous operations across multicore hardware to ensure data race safety. The project distinguishes itself through a native, bi-directional interoperability mechanism that allows for direct integration with existing codebases and external APIs without requiring complex foreign function interfaces. This capability is supported by a declarative, manifest-based build system that manages dependencies and cross-platform toolchain orchestration. Furthermore, the language provides a standardized language server protocol implementation, enabling real-time diagnostics, code completion, and refactoring across a wide range of development environments. The ecosystem covers a broad capability surface, including support for static binary compilation to ensure portability across diverse system environments and specialized tooling for cloud-native backend development. It provides comprehensive infrastructure for multi-platform application development, including cross-compilation support for Android, Linux, and WebAssembly targets. Developers can also leverage integrated debugging, testing, and interactive playground environments to streamline the software validation process. The project maintains its compiler, standard library, and evolution proposals through a primary source code repository, which includes extensive documentation and guided references for developers.
swiftlang/swift
firecrawl/firecrawl
84,034
Firecrawl is a web data extraction platform designed to convert unstructured web content into clean, LLM-ready formats like markdown or JSON. It functions as an autonomous web crawler and scraper, capable of mapping entire domains, performing recursive navigation, and executing complex data gathering tasks. By leveraging headless browser orchestration, the system handles dynamic, JavaScript-heavy pages to ensure comprehensive data capture. The platform distinguishes itself through its focus on agentic workflows, providing a programmatic interface that allows autonomous agents to perform live web research, interact with pages, and execute multi-step navigation tasks. It supports distributed crawling infrastructure, enabling users to scale data collection across multiple nodes while managing concurrency and long-running jobs through asynchronous queueing. The system also integrates with agentic frameworks via standardized protocols, allowing for seamless connection to AI-powered clients and automated pipelines. Beyond its core extraction capabilities, the project provides a suite of developer tools for site mapping, batch scraping, and web searching. It includes features for stateful session persistence, webhook-based notifications, and configurable crawl depth, allowing for granular control over how information is retrieved and processed. The project offers comprehensive API documentation and SDKs to facilitate integration into backend services and local development environments. Users can deploy the crawling infrastructure within their own private networks or utilize managed cloud services.
aiai-agentsai-crawler
firecrawl/firecrawl

Browse repositories

jackfrued/Python-100-Days

rclone/rclone

ngosang/trackerslist

mermaid-js/mermaid

yt-dlp/yt-dlp

papers-we-love/papers-we-love

electron/electron

WerWolv/ImHex

TheAlgorithms/Python

marktext/marktext

rust-lang/rustlings

Stirling-Tools/Stirling-PDF

meta-llama/llama

deepseek-ai/DeepSeek-V3

microsoft/markitdown

langchain-ai/langchain

tailwindlabs/tailwindcss

MisterBooo/LeetCodeAnimation

swiftlang/swift

firecrawl/firecrawl