Dataset Preparation
August 19, 2025 ยท View on GitHub
- Osprey-724K ๐ค download
| Data | Size |
|---|---|
| osprey_short_form.json | 57 MB |
| osprey_conversation.json | 106 MB |
| osprey_detail_description.json | 63.4 MB |
| osprey_part_level.json | 153 MB |
| osprey_lvis_positive_negative.json | 140 MB |
- COCO: train2017,
imgsshould contain all the images including training set and validation set. - pascal_part: train.json, VOCdevkit.
- partImagenet: train_format.json, PartImageNet_OOD.
- refcocos: refcoco, refcoco+.
- vg: vg_train_with_mask.json (mask is generated from HQ-SAM), images can be downloaded from OpendataLab,
imageshould contain all the vg images(VG_100K and VG_100K_2). - vcr: vcr.
After downloading all of them, organize the data as follows in ./data,
โโโ coco
โ โโโ annotations
โ โ โโโ instances_train2017.json
โ โโโ imgs
โโโ part data
โ โโโ pascal_part
โ โ โโโ train.json
โ โ โโโ VOCdevkit
โ โโโ partImagenet
โ โโโ train_format.json
โ โโโ train
โโโ refcocos
โ โโโ finetune_refcoco_train_with_mask.json
โ โโโ finetune_refcoco+_train_with_mask.json
โโโ Osprey-724K
โ โโโ osprey_short_form.json
โ โโโ osprey_conversation.json
โ โโโ osprey_detail_description.json
โ โโโ osprey_part_level.json
โ โโโ osprey_lvis_positive_negative.json
โโโ vg
โ โโโ vg_train_with_mask.json
โ โโโ image
โโโ vcr
โโโ train.jsonl
โโโ vcr1images