Data Setup

March 1, 2022 ยท View on GitHub

We provide access to our preprocessed data (including extracted features) and preprocessing scripts to replicate our setup.

Preprocessed Data

More recent (from IGLUE) and with more backbones:

NB: I have noticed that uploading LMDB files made their size grow to the order of TBs. So, instead, I recently uploaded the H5 versions that can quickly be converted to LMDB locally using this script.

Preprocessing Steps

I originally relied on Hao Tan's airsplay/bottom-up-attention Docker image to extract image features from Faster R-CNN. For more details about the Docker image, see the LXMERT repository.

Recently, I have switched to Hao Tan's Detectron2 implementation of 'Bottom-up feature extractor', which is compatible with the original Caffe implementation. See here for step-by-step instructions.

Moreover, it is possible to extract Faster R-CNN features with a ResNeXt-101 backbone from the mmf repository following these instructions.


For detailed preprocessing procedures, check out the README files for each data set in this folder or under feature_extraction/.