←BackZzhegan27/VILLA0Copy as MarkdownView on GitHub↗0 stars·0 forks·0 viewsVILLAFeaturesMultimodal Pretraining - Adversarial training for vision-and-language representations.Star history