Pretrained Models

November 13, 2020 ยท View on GitHub

This document contains the models to reproduce of results by using the are searched archtectures from MMnas. The prototxt files for mmnas architecture for different tasks can be found in the arch folder. We provide pretrained models on different tasks to reproduce the results reported in our paper. You can download these ckpt files, place them at logs/ckpts/, and run the train_[vqa/vgd/itm].py to evaluate the performance.

VQA

We train the mmnas_vqa on the train+val+vg split and evaluated the model on the test-dev split. The pretrained model can be downloaded here. For comparsion, we also report the results from previous state-of-the-art mcan model.

ModelBase lrOverall (%)Yes/No (%)Number (%)Other (%)
mcan1e-470.6987.0853.1660.66
mmnas1e-471.2587.2055.6361.15

VGD

We use the same mmnas_vgd archtecture for all the three datasets and then train the model for each dataset independently. The pretrained models on three datasets can be downloaded as follows.

RefCOCO RefCOCO+ RefCOCORg
ValTestATestB
83.66%87.25%78.78
ValTestATestB
74.48%81.00%65.15%
ValTest
74.59%75.42%
model model model

ITM

Using the mmnas_itm archtecture, we obtain the model to report the following results. The pretrained model can be downloaded here.

Text Retrival Image Retrival
R@1R@5R@10
77.3093.5097.10
R@1R@5R@10
60.8884.8690.40