30 open-source projects similar to hustvl/lightningdit, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best LightningDiT alternative.
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
CVPR 2025 Highlight X-Dyna: Expressive Dynamic Human Image Animation
CVPR 2025 Consistent and Controllable Image Animation with Motion Diffusion Models
CVPR 2025 Highlight🔥 Identity-Preserving Text-to-Video Generation by Frequency Decomposition
CVPR2025 We present SleeperMark, a novel framework designed to embed resilient watermarks into T2I diffusion models
CVPR 2025 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
This project is a computer vision benchmark and image classification dataset used to measure and compare the accuracy of machine learning models. It provides a standardized collection of labeled fashion product images and training data formatted to be compatible with the MNIST dataset structure. The dataset consists of fixed-dimension grayscale images and label-based category mappings, stored in a binary format. It includes pre-split training and testing sets and a static distribution to ensure consistent cross-model benchmarking. The repository supports image classification benchmarking and
Project page for End-to-end Recovery of Human Shape and Pose
Angjoo Kanazawa \ , Shubham Tulsiani \ , Alexei A. Efros, Jitendra Malik
Code for "Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks", Gupta et al, CVPR 2018
The project is an official implement of our CVPR2018 paper "Deep Back-Projection Networks for Super-Resolution" (Winner of NTIRE2018 and PIRM2018)
Attention-based Deep Multiple Instance Learning
记录每天整理的计算机视觉/深度学习/机器学习相关方向的论文
3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans. This work is based on our ECCV'18 paper, 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation.
Contains implementation of Guided Attention Inference Network (GAIN) presented in Tell Me Where to Look(CVPR 2018). This repository aims to apply GAIN on fcn8 architecture used for segmentation.
CVPR2018 - pixel embedding & grouping for structured prediction, e.g., instance segmentation
Code for PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning
Code for Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights
Code for our CVPR 2018 paper: "Synthesizing Images of Humans in Unseen Poses"
Maxim Berman, Amal Rannen Triki, Matthew B. Blaschko
This is the implementation of Hierarchical Long-term Video Prediction without Supervision, to be published in ICML 2018.
Open source release of the evaluation benchmark suite described in "Realistic Evaluation of Deep Semi-Supervised Learning Algorithms"
PlaneNet: Piece-wise Planar Reconstruction from a Single RGB Image
Official implementation of GANimation. In this work we introduce a novel GAN conditioning scheme based on Action Units (AU) annotations, which describe in a continuous manifold the anatomical facial movements defining a human expression. Our approach permits controlling the magnitude of…
This is re-implementation of "ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices"
PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"
Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples