FuseTeacher

November 26, 2024 ยท View on GitHub

Dataset

  • The YFCC15M-Cap dataset has been released: YFCC15M-Cap
  • The Union23M and Union65M datasets are coming soon.

Training

Train FuseTeacher on YFCC15M-Cap dataset:

sh train.sh

Citation

If you find this repository useful, please consider citing our paper:

@inproceedings{FuseTeacher2024,
  title = {FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors},
  author = {Xie, Chen-Wei and Sun, Siyang and Zhao, Liming and Li, Pandeng and Ma, Shuailei and Zheng, Yun},
  booktitle = {ECCV},
  year = {2024}
}

Some code is borrowed from ALBEF, CLIP, and timm. Thanks a lot to them.