488 repos
Explore 488 awesome GitHub repositories matching category · DevOps & Infrastructure. Refine with filters or upvote what's useful.
This project is a collaborative knowledge base and technical learning resource that provides a detailed breakdown of the internal processes occurring within modern computing environments. It serves as a comprehensive educational reference, tracing the step-by-step operations triggered by common user interactions and network requests to explain how hardware and software components interact across the entire stack. The guide distinguishes itself by offering deep technical insights into the journey from physical input to visual output. It covers the low-level mechanics of hardware interrupt hand
Copyparty is a self-hosted file server that provides a browser-based interface for managing, browsing, uploading, and downloading files. It utilizes a virtual file system abstraction to map diverse storage backends and network-attached devices into a unified directory structure, allowing for consistent file access across various storage environments. The platform functions as a cloud synchronization gateway, enabling automated data backups and transfers between local storage and remote cloud providers through integration with standard command-line tools. It also serves as a distributed storag
Alamofire is an HTTP networking library that provides a foundation for managing network requests and responses through a chainable, type-safe interface. It serves as an asynchronous request manager, coordinating concurrent network operations and data streams while maintaining application responsiveness. The library distinguishes itself through a protocol-oriented request adaptation system, which utilizes interceptors to modify or authenticate requests before dispatch. It employs a middleware-driven pipeline to process traffic, handling encoding, authentication, and error recovery in a modular
Mitmproxy is an interactive, programmable network proxy engine designed for traffic analysis and protocol manipulation. It functions as a gateway that intercepts, inspects, and modifies network traffic in real-time, supporting HTTP, HTTPS, WebSocket, DNS, and generic TCP or UDP streams. By acting as a trusted certificate authority, the proxy can dynamically generate and sign certificates to decrypt and analyze secure TLS-encrypted connections. The project distinguishes itself through a highly extensible, event-driven architecture that allows users to automate traffic transformation using cust
Pandoc is a universal document converter that translates content between a wide range of markup and binary formats. It functions by parsing input documents into a unified intermediate abstract syntax tree, which serves as the foundation for consistent manipulation and transformation across diverse output types. The system is distinguished by its modular reader-writer pipeline, which decouples input parsing from output generation to allow for granular control over document structure. Users can programmatically manipulate this intermediate tree through a robust filter system, supporting both ex
Umi-OCR is an optical character recognition engine designed to convert visual text from images and documents into machine-readable character data. It functions as a local-first toolkit, processing all visual data directly on the host machine using embedded neural network models to maintain privacy and offline availability. The project distinguishes itself through its focus on automated document digitization and integrated barcode and QR code decoding. By utilizing a modular, Python-based orchestration layer, it enables users to transform static image files and multi-page documents into search
This project is a terminal multiplexer that enables multiple terminal sessions to run simultaneously within a single window or a detached background process. By decoupling the client interface from a persistent server process, it allows users to maintain long-running command-line tasks that continue to execute even after disconnecting from a remote host. The system functions as a terminal window manager and process controller, providing a text-based interface to organize multiple shell processes into custom tiled layouts. It distinguishes itself through a programmable command-line interface t
RSSHub is a headless, server-side engine designed to generate standardized RSS and Atom feeds from websites that do not natively provide them. By acting as an extensible data aggregator, it enables the automated collection of web content, allowing users to monitor updates from disparate sources through centralized feed readers or workflow automation tools. The platform distinguishes itself through a route-based data extraction framework that maps specific URL patterns to custom scraping logic. This modular architecture is supported by a middleware-driven request pipeline and declarative route
Zod is a TypeScript-first schema declaration and validation library designed to ensure end-to-end data integrity. It functions as a runtime type guard, allowing developers to define complex data structures through a declarative, chainable syntax. By using these schema definitions, the library automatically derives static TypeScript types, eliminating the need for manual type duplication and ensuring that runtime data matches expected application contracts. The library distinguishes itself through functional schema composition, which enables the creation of hierarchical structures by nesting a
fd is a command-line program designed to locate files and directories within a filesystem. It serves as an alternative to traditional search utilities by providing a simplified interface for finding entries based on patterns and metadata. The tool emphasizes performance and ease of use by incorporating colorized output and smart case sensitivity by default. It automatically ignores hidden files and patterns specified in version control ignore files, streamlining the search process by focusing on relevant project content. The utility supports regular expression syntax for complex matching and
DeepSpeed is a high-performance library designed to scale deep learning model training and inference across massive clusters of GPUs and compute nodes. It provides a comprehensive suite of tools for distributed training, enabling the execution of models that exceed the memory capacity of single devices through advanced parameter partitioning, pipeline-based model parallelism, and memory-efficient state offloading. The framework distinguishes itself through specialized communication-efficient optimizers and hardware-aware acceleration techniques. By utilizing gradient compression, quantization
This project is a full-stack web application scaffolder designed to initialize production-ready projects with pre-configured database, authentication, and deployment settings. It provides a standardized starting point for development by generating a complete application structure that includes integrated backend, frontend, and database components. The template distinguishes itself through a type-safe integration layer that automatically synchronizes backend API definitions with frontend client code, ensuring consistent data exchange. It also features a containerized development environment th
Exo is a distributed inference engine designed to run machine learning models across local hardware. It functions as a network orchestration layer that automatically discovers available devices to form a unified computing cluster, allowing users to scale artificial intelligence workloads by distributing computational tasks across multiple machines. The platform distinguishes itself through its ability to manage the entire lifecycle of local models while providing a standardized gateway for external applications. By translating local model outputs into industry-standard formats, it enables exi
Yarn is a command-line package manager for JavaScript projects that automates the installation, versioning, and configuration of external code dependencies. It functions as a deterministic build tool, utilizing a lockfile to calculate a fixed dependency graph that ensures identical package versions across development, testing, and production environments. The project distinguishes itself through a content-addressable storage system that indexes packages by hash to eliminate redundant downloads and enable instant linking. It incorporates a virtual file system mapping that presents a unified vi
Ray is a distributed computing framework designed to scale Python and Java applications across clusters by abstracting task scheduling and resource management. It functions as a resource-aware execution engine that manages task dependencies, placement, and fault tolerance across networked compute nodes. At its core, the system provides a stateful actor model, allowing developers to define classes that run in dedicated processes to maintain and mutate internal state across remote method calls. The framework distinguishes itself through a robust cross-language interoperability layer, enabling f
This project is an uncompromising, deterministic code formatter for Python. It functions by parsing source code into an abstract syntax tree and regenerating it according to a rigid, opinionated set of style rules. By automating the formatting process, it eliminates manual style debates and configuration overhead, ensuring that code remains consistent across entire projects regardless of the original input. The tool distinguishes itself through its focus on speed and seamless integration into development workflows. It utilizes content-based file caching and parallel processing to maintain hig
This project is a curated knowledge base and technical reference for shell scripting, focused on performing common system tasks using only built-in shell features. It serves as a guide for implementing logic and automation without relying on external binaries or dependencies, ensuring scripts remain portable across standard Unix-like environments. The repository distinguishes itself by emphasizing native shell functions and syntax to minimize process forking and improve execution performance. It provides idiomatic patterns for complex string transformations, pattern matching, and data flow ma
Siyuan is a self-hosted knowledge management platform designed for private note-taking and information organization. It functions as a local-first application that stores all user content as plain text files on the local file system, ensuring data ownership and offline availability. The platform utilizes a block-based document model, which structures information as a tree of independent content blocks to facilitate granular manipulation and bidirectional linking. Users can extend the core functionality through a sandboxed plugin architecture, allowing for the development of custom themes and
ColossalAI is a deep learning system designed to facilitate the training and inference of large-scale artificial intelligence models. It provides a unified framework for distributed computing, enabling the scaling of model parameters and data across multiple hardware accelerators. The project focuses on optimizing memory usage and computational efficiency through advanced parallelization strategies. By integrating techniques for data, pipeline, and tensor parallelism, it allows for the management of models that exceed the capacity of individual devices. The system includes a comprehensive su
Hexo is a command-line static site generator designed for content-driven blogging and website creation. It functions as a structured framework that transforms plain text files and markdown into production-ready static websites, utilizing a template-based rendering engine to separate site content from visual presentation. The project is distinguished by its event-driven build pipeline, which manages the entire site lifecycle through a series of hooks for file processing, asset generation, and deployment. Developers can extend the system’s core capabilities through a modular plugin architecture