Prepare Datasets
May 6, 2026 · View on GitHub
We provide instructions for preparing the dataset for LiteLVLM.
Download Datasets
For Referring Expression Segmentation, we adopt the COCO Referring Expression Comprehension datasets: RefCOCO, RefCOCO+, and RefCOCOg.
Download links and directory structure:
Download the data and orgnaize as follows:
data
├── Refer_Segm
│ ├── refcoco
│ │ ├── instances.json
│ │ ├── refs(google).p
│ │ ├── refs(unc).p
│ ├── refcoco+
│ ├── refcocog
│ ├── coco_2014
│ │ ├── train2014
│ │ │ ├── COCO_train2014_000000000009.jpg
│ │ │ ├── COCO_train2014_000000000025.jpg