11 repos
Explore 11 awesome GitHub repositories matching artificial intelligence & ml · Hardware & Acceleration. Refine with filters or upvote what's useful.
The Linux kernel is a monolithic operating system kernel that serves as the primary interface between computer hardware and software applications. It provides the foundational infrastructure for managing system resources, including memory allocation, process scheduling, and synchronization primitives. The project inclu
Standardizes the management of sound cards and low-level audio processing drivers through a robust hardware framework.
TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The syst
Executes high-performance element-wise functions, trigonometric operations, and logical reductions across multi-dimensional arrays.
Stable Diffusion Web UI is a browser-based interface designed for managing text-to-image generation tasks. It provides a centralized dashboard for controlling generative processes, including native support for multi-stage model architectures to facilitate high-quality image refinement. The platform distinguishes itsel
Configures hardware-specific settings to leverage NVIDIA graphics processing units for accelerated computation.
Godot is a comprehensive, node-based game engine designed for building interactive 2D and 3D applications. It provides an integrated development environment that utilizes a hierarchical scene system to organize objects, propagate spatial transformations, and manage lifecycle events. The engine functions as a cross-plat
Normalizes hardware-specific tasks like input, audio, and file I/O across heterogeneous deployment targets.
PyTorch is a machine learning framework centered on a GPU-ready tensor library that supports multi-dimensional array operations across both CPU and accelerator hardware. It provides a foundational infrastructure for mathematical computation and dynamic neural network construction, utilizing a tape-based automatic diffe
Accelerates multi-dimensional array operations by leveraging native GPU and specialized hardware support.
Llama.cpp is an inference engine designed for the local execution of text-based and multimodal language models on consumer hardware. It provides a core environment for running models that process both text and image inputs, utilizing hardware-accelerated backends to optimize performance across diverse CPU and GPU archi
Unifies diverse CPU and GPU architectures through a common interface to normalize model execution across heterogeneous hardware.
Home Assistant is a centralized home automation platform designed to orchestrate diverse internet-connected devices and services. It functions as a local-first control system that normalizes heterogeneous hardware protocols into a unified set of entities, attributes, and services. The core architecture relies on an eve
Normalizes heterogeneous hardware protocols into a consistent set of entities, attributes, and services.
Deep-Live-Cam is a generative video transformation tool designed for real-time facial manipulation and cinematic enhancement. It functions as a local-first AI runtime, performing all media processing directly on the user's hardware to ensure complete data privacy without external network dependencies. By utilizing a hi
Routes model inference tasks to hardware-specific acceleration APIs like CUDA or CoreML.
This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners
Discusses optimization techniques for leveraging hardware acceleration to improve throughput in large-scale model training.
nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi
Executes high-dimensional array operations and mathematical functions essential for training deep neural networks.
This repository provides a collection of practical demonstrations and implementation guides for machine learning tasks using TensorFlow.js. It serves as a resource for developers to explore model architectures, training workflows, and data manipulation techniques across domains such as computer vision, natural language
Explicit disposal methods for layer and model objects enable developers to reclaim GPU-resident memory in environments lacking automatic garbage collection.