# yisol/idm-vton

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/yisol-idm-vton).**

4,881 stars · 799 forks · Python · other

## Links

- GitHub: https://github.com/yisol/IDM-VTON
- Homepage: https://idm-vton.github.io/
- awesome-repositories: https://awesome-repositories.com/repository/yisol-idm-vton.md

## Description

IDM-VTON is an AI virtual try-on framework and fashion synthesis tool designed to generate realistic images of people wearing specific garments. It operates as a diffusion-based image generator that blends garment textures with human poses to create synthetic fashion imagery.

The system implements virtual fitting room capabilities through a generative model that combines person and clothing inputs. It includes a web-based interface to run interactive visual demonstrations and synthesize try-on images in real-time.

The framework covers the broader domain of AI fashion visualization, enabling the creation of digital previews of clothing items on human bodies.

## Tags

### Part of an Awesome List

- [Virtual Try-On Systems](https://awesome-repositories.com/f/awesome-lists/ai/virtual-try-on-systems.md) — Implements a deep learning framework for synthesizing realistic images of people wearing specific garments. ([source](https://cdn.jsdelivr.net/gh/yisol/idm-vton@main/README.md))
- [Fashion Image Synthesis](https://awesome-repositories.com/f/awesome-lists/ai/fashion-image-synthesis.md) — Acts as a computational tool for generating synthetic fashion imagery through texture and pose blending.

### Artificial Intelligence & ML

- [Fashion Visualization](https://awesome-repositories.com/f/artificial-intelligence-ml/fashion-visualization.md) — Enables the creation of digital visual previews of clothing items on human bodies.
- [Image-to-Image Diffusion Toolkits](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-pipelines/text-to-image-generators/image-inpainting/image-to-image-diffusion-toolkits.md) — Provides a diffusion-based toolkit for synthesizing high-fidelity try-on images from text and image prompts.
- [Latent Diffusion Models](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-models/latent-diffusion-models.md) — Utilizes latent diffusion models to create high-fidelity clothing overlays on human figures.
- [Spatial Correlation Alignments](https://awesome-repositories.com/f/artificial-intelligence-ml/feature-alignment/spatial-correlation-alignments.md) — Implements attention-based feature alignment to correlate garment patches with human body geometry.
- [Latent Space Encoders](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-models/latent-space-generative-models/latent-space-projections/latent-space-encoders.md) — Utilizes latent space encoders to process compressed image representations for reduced computational cost.
- [Cross-Attention Conditioning](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-pipelines/text-to-video-generators/cross-attention-conditioning.md) — Steers the image generation network by injecting garment-specific embeddings via cross-attention conditioning.
- [U-Net Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/neural-network-architectures/u-net-architectures.md) — Employs a U-Net architecture with skip connections to recover spatial details during the denoising process.

### Graphics & Multimedia

- [Virtual Fitting Rooms](https://awesome-repositories.com/f/graphics-multimedia/room-visualizers/room-cleaning-queues/virtual-fitting-rooms.md) — Provides the underlying generative model to implement a digital virtual fitting room experience.
- [Mask-Based Area Replacement](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/face-portrait-manipulation/image-masking/mask-based-area-replacement.md) — Uses spatial masks to isolate garment and person areas for precise generative replacement.

### User Interface & Experience

- [Virtual Try-On Interfaces](https://awesome-repositories.com/f/user-interface-experience/virtual-try-on-interfaces.md) — Provides a web-based application interface for real-time visual demonstrations of garment try-ons.
