Setup
March 10, 2022 · View on GitHub
Create two folders imagenet_info and text_info in the current project directory
/path/to/DeCLIP/
├── docs/
├── experiments/
├── linklink/
├── prototype/
├── text_info/
├── imagenet_info/
...
Pretrain Dataset
YFCC15M Setup
-
First Download our YFCC15M label file - Google Driver and put it into
imagenet_infodir -
Download Image data, You have two ways to download Image data:
- DownLoad by labels: Crawl the image by the url in label dirctely.
- Filter by label: Download offical YFCC100M data, and Prepare the YFCC15M subset metadata pickle by the label.
Text
- Download our vocab file for Text encoder Google Driver
- put it into
text_infodir
Downstream Dataset
Imagenet Setup
- DownLoad offical ImageNet Dataset
- DownLoad our ImageNet validation label file - Google Driver
- put it into
imagenet_infodir