Code for the Grounded Visual Question Answering (GVQA) model from the below paper:
This is the code that we wrote to train the state-of-the-art VQA models described in our paper. Our ensemble of 7 models obtained 66.67% on real open-ended test-dev and 70.24% on real multiple-choice test-dev.
This is the code for the NeurIPS 2019 article available here: https://arxiv.org/abs/1906.10169.