README.md

June 6, 2025 ยท View on GitHub

Data

The table below shows the training annotations and their corresponding image and video sources download links:

Images (Label)

DatasetLink
LVIShttps://cocodataset.org/#download (train2017)
obj365https://www.objects365.org/overview.html
openimageshttps://storage.googleapis.com/openimages/web/index.html
PACOhttps://cocodataset.org/#download (train2017)
V3Dethttps://v3det.openxlab.org.cn/

Images (Caption)

DatasetLink
RefCOCOhttps://bvisionweb1.cs.unc.edu/licheng/referit/data/refcoco.zip
RefCOCO+https://bvisionweb1.cs.unc.edu/licheng/referit/data/refcoco+.zip
RefCOCOghttps://bvisionweb1.cs.unc.edu/licheng/referit/data/refcocog.zip
RefTexthttps://github.com/Buki2/STAN
Visual Genomehttps://homes.cs.washington.edu/~ranjay/visualgenome/index.html
GREShttps://cocodataset.org/#download (train2014)
Google_Refexphttps://cocodataset.org/#download (train2014)
Rexverse-2Mhttps://huggingface.co/datasets/IDEA-Research/Rexverse-2M

Videos

The table below shows the training annotations and their corresponding video sources download links. Note, for each video source (.mp4), please first refer to extract_mp4_frames.py to extract frames.

DatasetLink
A2Dhttps://kgavrilyuk.github.io/publication/actor_action/
BenSMOThttps://github.com/HengLan/SMOT
DAVIS17https://davischallenge.org/davis2017/code.html
HC-STVGhttps://github.com/tzhhhh123/HC-STVG
LV-VIShttps://github.com/haochenheheda/LVVIS
SA-Vhttps://ai.meta.com/datasets/segment-anything-video/
VidSTGhttps://github.com/Guaranteer/VidSTG-Dataset
YoutubeVOShttps://youtube-vos.org/