Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models

October 19, 2023 · View on GitHub

This repository provides a new type of tuning method, termed as regularized mask tuning, which masks the network parameters through a learnable selection.

timeline.jpg

Overview of methods supported by our method:

[Paper]

Code Coming Soon

Experiments:

MethodImageNetCaltech101FGVCAircraftStanfordCarsFlowers102OxfordPetsFood101DTDEuroSATUCF101SUN397AverageGain
Zero-shot CLIP66.7392.9424.7265.3271.3489.2186.0644.3947.6066.7562.5065.23-
R-AMT73.0797.0058.4785.9398.1793.8087.4774.5791.8086.9376.4083.96+18.73
CoOP72.0195.4743.2982.9196.9391.9284.3369.2186.0582.2574.5879.90-
CoOP+R-AMT73.3596.7056.3785.6397.8393.2086.1373.0390.2086.8775.4583.16+3.26
TIP-Adapter73.0895.6345.2083.0496.1592.6687.3171.5788.5384.2476.2181.24-
TIP-Adapter+R-AMT74.2896.9761.0786.2797.8094.0787.4374.7791.5086.9376.9784.37+3.13

BibTeX

@inproceedings{RMT2023,
  title   = {Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models},
  author  = {Zheng, Kecheng and Wu, Wei and Feng$, Ruili and Zhu Kai and Liu, Jiawei and Zhao, Deli and Zha Zheng-Jun and Chen Wei and Shen, Yujun},
  booktitle = {ICCV},
  year    = {2023}
}

License

The project is under MIT License.