27 repos
Explore 27 awesome GitHub repositories matching artificial intelligence & ml · Optimization & Inference. Refine with filters or upvote what's useful.
This project is a comprehensive educational roadmap designed to guide software engineers through the mastery of computer science fundamentals and technical interview preparation. It provides a structured, dependency-aware learning path that organizes complex computing concepts into a hierarchical curriculum, enabling u
Master the mathematical foundations of objective function optimization and constraint satisfaction essential for algorithmic problem solving.
The Linux kernel is a monolithic operating system kernel that serves as the primary interface between computer hardware and software applications. It provides the foundational infrastructure for managing system resources, including memory allocation, process scheduling, and synchronization primitives. The project inclu
Simplifies the selection and configuration of specific drivers and features during the software build process.
This project is a comprehensive repository of verified computational implementations designed to serve as an educational resource for computer science and algorithmic problem solving. It provides a structured collection of code examples that cover fundamental data structures, mathematical operations, and core programmi
Resolve objective functions under linear constraints to determine the most efficient resource distribution.
TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The syst
Executes high-performance element-wise functions, trigonometric operations, and logical reductions across multi-dimensional arrays.
Stable Diffusion Web UI is a browser-based interface designed for managing text-to-image generation tasks. It provides a centralized dashboard for controlling generative processes, including native support for multi-stage model architectures to facilitate high-quality image refinement. The platform distinguishes itsel
Configures hardware-specific settings to leverage NVIDIA graphics processing units for accelerated computation.
Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering
Coordinates data, pipeline, and tensor parallelism to scale large-scale model training across multi-node clusters.
This project is a high-level 3D graphics engine designed to render complex, hardware-accelerated environments within web browsers. It provides a comprehensive abstraction layer that manages scene graphs, cameras, and lighting, mapping high-level scene definitions onto low-level graphics APIs. By decoupling these defini
Improves rendering efficiency for large object counts through techniques like instancing and batching.
Godot is a comprehensive, node-based game engine designed for building interactive 2D and 3D applications. It provides an integrated development environment that utilizes a hierarchical scene system to organize objects, propagate spatial transformations, and manage lifecycle events. The engine functions as a cross-plat
Normalizes hardware-specific tasks like input, audio, and file I/O across heterogeneous deployment targets.
Ant Design is an enterprise-grade component library and design system framework built for developing complex, data-heavy web applications. It provides a comprehensive collection of pre-built, state-driven interface elements that map data properties to rendered components, ensuring consistent interaction patterns and vi
Minimizes unnecessary re-renders by memoizing context values and splitting large data objects within the provider hierarchy.
PyTorch is a machine learning framework centered on a GPU-ready tensor library that supports multi-dimensional array operations across both CPU and accelerator hardware. It provides a foundational infrastructure for mathematical computation and dynamic neural network construction, utilizing a tape-based automatic diffe
Accelerates multi-dimensional array operations by leveraging native GPU and specialized hardware support.
This repository serves as a comprehensive collection of resources, templates, and starter code for building artificial intelligence applications. It provides a centralized hub for developers to access practical implementations of common workflows, including retrieval-augmented generation pipelines and autonomous agent
Utilities and techniques help reduce token consumption and operational costs while preserving output quality.
Llama.cpp is an inference engine designed for the local execution of text-based and multimodal language models on consumer hardware. It provides a core environment for running models that process both text and image inputs, utilizing hardware-accelerated backends to optimize performance across diverse CPU and GPU archi
Unifies diverse CPU and GPU architectures through a common interface to normalize model execution across heterogeneous hardware.
This project provides a command-line interface for managing autonomous agent workflows, task orchestration, and system-level automation. It includes a comprehensive framework for defining agent skills, managing persistent memory, and delegating tasks to specialized subagents. Users can configure complex planning modes,
Latency and API costs are minimized by storing previously processed prompt tokens for reuse in subsequent requests.
Utility-first CSS framework for fast, design-system-friendly styling.
Boosts rendering performance by leveraging a high-speed engine that optimizes 3D transforms and real-time visual updates.
Gin is a web framework designed for building high-performance web services and APIs. It functions as a middleware-oriented engine that processes incoming HTTP requests through a sequential chain of handlers, allowing for the modular management of cross-cutting concerns such as authentication and logging. The framework
Bypasses runtime reflection overhead by invoking handler functions through direct static type assertions.
Home Assistant is a centralized home automation platform designed to orchestrate diverse internet-connected devices and services. It functions as a local-first control system that normalizes heterogeneous hardware protocols into a unified set of entities, attributes, and services. The core architecture relies on an eve
Normalizes heterogeneous hardware protocols into a consistent set of entities, attributes, and services.
Deep-Live-Cam is a generative video transformation tool designed for real-time facial manipulation and cinematic enhancement. It functions as a local-first AI runtime, performing all media processing directly on the user's hardware to ensure complete data privacy without external network dependencies. By utilizing a hi
Routes model inference tasks to hardware-specific acceleration APIs like CUDA or CoreML.
This project is a community-driven knowledge base and curated repository focused on natural language processing and large language model development. It serves as a centralized index for high-quality tools, libraries, and research materials, organizing technical resources into structured, version-controlled documentati
Indexes optimization techniques, training methodologies, and low-resource deployment strategies for large-scale language models.
This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners
Features implementations of adaptive moment estimation to optimize stochastic objective functions.
This project is a community-driven educational repository that serves as a comprehensive directory of university-level computer science video lectures. It provides a structured learning path for students and professionals, aggregating high-quality academic resources to facilitate self-paced study across a wide range of
Bundles academic resources that explain the mathematical methods used to optimize machine learning models.