dataset.md
March 29, 2024 ยท View on GitHub
Data
Stage 1: Pre-training Dataset
-
Please download the transformed annotations of each dataset from Stage-1 Training Annotations.
-
Please download the image from the official source.
| Data | Source | Data | Source |
|---|---|---|---|
| COCO 2014 | Download | VOCdevkit | Download |
| COCO 2017 | Download | DocBank | Download |
| Visual Genome | Download | DocLayNet | Download |
| Object365 | Download | PubLayNet | Download |
| OpenImage | Download | CurvedSynText150k | Download |
| V3Det | Download | ICDAR2013 | Download |
| ADE20k | Download | MLT2017 | Download |
| Cityscapes | Download | MLT2019 | Download |
| cocostuff 10k | Download | TotalText | Download |
| cocostuff 164k | Download | AITW | Download |
Important notice: Visual Genome should contain all the vg images(VG_100K and VG_100K_2). Merge the image data from the VG_100K and VG_100K_2 folders into one.
- In each annotation JSON file, update the image path to reflect the location of the downloaded image data.
Stage 2: Fine-tuning Dataset
-
Please download the transformed annotations of each dataset from Stage-2 Training Annotations.
-
Please download the image from the official source. The data for stages beyond stage 1 is list below:
| Data | Source | Data | Source |
|---|---|---|---|
| OpenPsgGCG | Download | SeeClick | Download |
| GRIT | Download | Multi-Panel | Download |
| Flicker30K | Download | Osprey-724K | Download |
| M6Doc | Download | LaionGPT4v | Download |
| VCR | Download | ShareGPT4v | Download |
- In each annotation JSON file, update the image path to reflect the location of the downloaded image data.