[ICLR 2025 ๐Ÿ”ฅ] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval

February 13, 2025 ยท View on GitHub

MSRVTT

For MSRVTT, the official data and video links can be found in link.

For the convenience, the splits and captions can be found in sharing from CLIP4Clip,

wget https://github.com/ArrowLuo/CLIP4Clip/releases/download/v0.0/msrvtt_data.zip

Besides, the raw videos can be found in sharing from Frozen in Time, i.e.,

wget https://www.robots.ox.ac.uk/~maxbain/frozen-in-time/data/MSRVTT.zip

Train on MSR-VTT

We conduct experiments on 4 A100x40G GPUs on MSR-VTT, in 2.2.0+cu118 Pytorch.

bash scripts/MSRVTT.sh