# pytorch/vision

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/pytorch-vision).**

17,522 stars · 7,208 forks · Python · bsd-3-clause

## Links

- GitHub: https://github.com/pytorch/vision
- Homepage: https://pytorch.org/vision
- awesome-repositories: https://awesome-repositories.com/repository/pytorch-vision.md

## Topics

`computer-vision` `machine-learning`

## Description

Vision is a comprehensive library for computer vision built for the PyTorch ecosystem. It serves as a central repository for deep learning research and production tasks, providing a collection of standardized datasets, modular model architectures, and high-performance image transformation utilities.

The project distinguishes itself by offering a deep learning model zoo that includes pre-trained architectures for image classification, object detection, and segmentation. It supports the entire lifecycle of computer vision development, from preprocessing and augmenting raw visual data to deploying optimized models on edge devices or scaling training across distributed computing clusters.

Beyond its core vision capabilities, the library facilitates generative image synthesis and multimodal data processing. It provides tools for configuring model parameters and implementing lightweight architectures, ensuring that developers can tailor neural networks to specific hardware requirements while maintaining performance.

The library is designed to integrate directly into existing PyTorch workflows, allowing users to instantiate standard architectures and pre-computed weights for immediate use in research or application development.

## Tags

### Artificial Intelligence & ML

- [Computer Vision Libraries](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-libraries.md) — Provides a comprehensive library of datasets, model architectures, and transformations for computer vision.
- [Computer Vision Models](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-models.md) — Provides foundational neural network architectures for image classification, object detection, and segmentation.
- [Computer Vision Preprocessing](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-preprocessing.md) — Offers a high-performance toolkit for transforming, normalizing, and augmenting image data.
- [Object Detection](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/computer-vision/object-detection-tracking/object-detection.md) — Identifies and localizes multiple objects within images using single-pass neural networks. ([source](https://pytorch.org/hub/nvidia_deeplearningexamples_ssd/))
- [Pretrained Model Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/custom-model-training/pretrained-model-integrations.md) — Initializes neural network architectures with pre-trained weights to enable immediate inference or fine-tuning. ([source](https://pytorch.org/hub/nvidia_deeplearningexamples_ssd/))
- [Deep Learning Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-learning-architectures.md) — Maintains a repository of pre-trained neural network architectures and weights for vision tasks.
- [Vision Model Loaders](https://awesome-repositories.com/f/artificial-intelligence-ml/vision-model-loaders.md) — Instantiates standard deep learning architectures with pre-computed weights for immediate use. ([source](https://pytorch.org/hub/pytorch_vision_alexnet/))
- [Image Classification Models](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/computer-vision/image-classification-models.md) — Provides pre-trained densely connected convolutional architectures for image classification tasks. ([source](https://pytorch.org/hub/pytorch_vision_densenet/))
- [Distributed Training](https://awesome-repositories.com/f/artificial-intelligence-ml/distributed-training-frameworks/distributed-training.md) — Provides frameworks for scaling neural network training across multiple devices and computing clusters.
- [Large-Scale Training Frameworks](https://awesome-repositories.com/f/artificial-intelligence-ml/large-scale-training-frameworks.md) — Scales neural network training across distributed computing clusters using differentiable collectives for gradient synchronization. ([source](https://pytorch.org/newsletter/march-2026/))
- [Edge AI Model Deployment](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-deployment-and-serving/local-and-on-device-inference/edge-ai-model-deployment.md) — Optimizes and exports neural network models for efficient execution on edge devices and cloud infrastructure.
- [Automatic Differentiation Frameworks](https://awesome-repositories.com/f/artificial-intelligence-ml/automatic-differentiation-frameworks.md) — Provides a comprehensive engine for automatic differentiation to enable gradient-based optimization of neural networks.
- [Hardware Acceleration Kernels](https://awesome-repositories.com/f/artificial-intelligence-ml/hardware-acceleration-kernels.md) — Executes specialized mathematical operations and kernel fusion to improve inference speed on hardware. ([source](https://pytorch.org/blog/portable-vllm-model-inference-kernels-in-helion/))
- [Generative Adversarial Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/image-generation/generative-adversarial-architectures.md) — Supports generative image synthesis through pre-trained generative adversarial network architectures.
- [Inference Acceleration](https://awesome-repositories.com/f/artificial-intelligence-ml/inference-acceleration.md) — Compiles computational graphs to accelerate inference and reduce processing time for complex architectures. ([source](https://pytorch.org/blog/pytorch-meetup-singapore-a-milestone-in-apac/))
- [Distributed Gradient Synchronization](https://awesome-repositories.com/f/artificial-intelligence-ml/gradient-computation/distributed-gradient-synchronization.md) — Synchronizes model parameters and gradients across multiple compute nodes to scale training workloads efficiently.
- [Hardware Dispatchers](https://awesome-repositories.com/f/artificial-intelligence-ml/hardware-acceleration-kernels/hardware-dispatchers.md) — Automatically selects and registers efficient compute kernels for rapid model execution. ([source](https://pytorch.org/blog/portable-vllm-model-inference-kernels-in-helion/))
- [Large Language Model Optimization](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-optimization-and-inference/serving-and-runtime/large-language-model-optimization.md) — Provides specialized utilities to improve the speed and efficiency of large language model operations. ([source](https://pytorch.org/blog/pytorch-meetup-singapore-a-milestone-in-apac/))
- [Multimodal Data Preprocessing Utilities](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/data-and-checkpointing/data-preprocessing/multimodal-data-preprocessing-utilities.md) — Provides utilities for transforming raw visual and sensory data into high-performance tensor formats.
- [Tokenizers](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing/tokenizers.md) — Converts raw text into numerical tokens compatible with transformer architectures for machine learning analysis. ([source](https://pytorch.org/hub/huggingface_pytorch-transformers/))
- [Lightweight Model Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/neural-network-implementations/lightweight-model-implementations.md) — Provides streamlined neural network designs to achieve high performance with lower computational complexity. ([source](https://pytorch.org/blog/lf-model-type/vision/))
- [Dynamic Graph Builders](https://awesome-repositories.com/f/artificial-intelligence-ml/gradient-computation/dynamic-graph-builders.md) — Supports dynamic computational graph construction to enable flexible neural network operations and gradient-based optimization.
- [Model Inference and Serving](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-inference-serving.md) — Offers a unified platform for serving text, image, video, and speech models with modality-aware scheduling. ([source](https://pytorch.org/blog/pytorch-meetup-singapore-a-milestone-in-apac/))
- [Model Performance Optimization](https://awesome-repositories.com/f/artificial-intelligence-ml/model-optimization/profiling-and-benchmarking/model-performance-optimization.md) — Offers hardware-specific kernel optimizations and quantization techniques to enhance model speed and efficiency. ([source](https://pytorch.org/newsletter/march-2026/))
- [Language Model Pretraining](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing/language-model-pretraining.md) — Implements optimized pretraining approaches for transformer-based language models. ([source](https://pytorch.org/blog/lf-model-type/nlp/))
- [Resolution Enhancers](https://awesome-repositories.com/f/artificial-intelligence-ml/image-super-resolution-models/resolution-enhancers.md) — Synthesizes high-resolution visual content by incrementally increasing output resolution during training. ([source](https://pytorch.org/blog/lf-model-type/generative/))
- [Model Compilation](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-optimization-and-inference/serving-and-runtime/inference-optimization-utilities/model-compilation.md) — Transforms trained models into optimized versions specifically prepared for efficient inference execution on target hardware. ([source](https://pytorch.org/blog/tag/media-entertainment/))
- [Inference Optimizations](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-optimization-and-inference/serving-and-runtime/inference-optimizations.md) — Provides techniques and mechanisms to reduce latency and increase throughput during model inference. ([source](https://pytorch.org/blog/pytorch-meetup-singapore-a-milestone-in-apac/))
- [Model Configuration](https://awesome-repositories.com/f/artificial-intelligence-ml/model-configuration.md) — Provides interfaces for configuring model parameters and architectural settings during initialization. ([source](https://pytorch.org/hub/huggingface_pytorch-transformers/))
- [Model Routing](https://awesome-repositories.com/f/artificial-intelligence-ml/model-routing.md) — Aggregates multiple inference engines and providers into a single gateway to unify request routing. ([source](https://pytorch.org/blog/lightseek-smg/))
- [Model Training Optimizers](https://awesome-repositories.com/f/artificial-intelligence-ml/model-training-optimizers.md) — Provides utilities and configurations for accelerating training convergence and performance. ([source](https://pytorch.org/blog/project/deepspeed/))
- [Multimodal Processing](https://awesome-repositories.com/f/artificial-intelligence-ml/multimodal-processing.md) — Executes high-performance vision preprocessing and tensor transformations for multimodal data streams. ([source](https://pytorch.org/blog/lightseek-smg/))

### Repository Format

- [Awesome List](https://awesome-repositories.com/f/repository-format/awesome-list.md) — A community-curated directory that catalogs and links out to other open-source projects, rather than a standalone tool you run yourself.

### DevOps & Infrastructure

- [Model Deployment Platforms](https://awesome-repositories.com/f/devops-infrastructure/model-deployment-platforms.md) — Orchestrates the execution of deep learning models across distributed cloud infrastructure for high-volume inference. ([source](https://pytorch.org/blog/tag/medical/))

### Software Engineering & Architecture

- [Modular Architectures](https://awesome-repositories.com/f/software-engineering-architecture/modular-architectures.md) — Organizes deep learning components into reusable layers and blocks to facilitate the construction of complex model topologies.

### Programming Languages & Runtimes

- [Kernel Fusion Operations](https://awesome-repositories.com/f/programming-languages-runtimes/runtime-execution-environments/runtime-environments/runtimes/graph-symbolic-execution-engines/operation-kernels/kernel-fusion-operations.md) — Combines multiple sequential mathematical operations into single compute kernels to minimize memory overhead and accelerate inference.

### Data & Databases

- [Tensor Execution](https://awesome-repositories.com/f/data-databases/lazy-evaluation-frameworks/tensor-execution.md) — Defers the evaluation of mathematical operations until necessary to allow for graph-level optimizations and efficient memory management.
