SIMMIM.md

March 1, 2025 ยท View on GitHub

Experiments on SimMIM

Preparation

Download the teacher Swin-Small to <BASE_FOLDER>/checkpoints/swin_small_800e.pth.tar

Pre-Training

# 1024 batch-sizes on 8 GPUs:
cd projects/simmim
ssl_train -b 2048 -d 0-7 -e 400 -f ./simmim_exp.py --amp \
--exp-options exp_name=simmim/simmim_swin_tiny_400e

Pre-Training with Decoupled Distillation

# 1024 batch-sizes on 8 GPUs:
cd projects/simmim
ssl_train -b 2048 -d 0-7 -e 400 -f ./simmim_distill_exp.py --amp --exp-options \
teacher_ckpt_path="<BASE_FOLDER>/checkpoints/swin_small_800e.pth.tar" exp_name=simmim/simmim_swin_tiny_d2_distill_400e

Fine-Tuning

# 1024 batch-sizes on 8 GPUs:
cd projects/eval_tools
ssl_train -b 1024 -f finetuning_swin_exp.py --amp [--ckpt <checkpoint-path>] \
--exp-options pretrain_exp_name=simmim/simmim_swin_tiny_d2_distill_400e

Main Results

pre-train codepre-train
epochs
fine-tune recipefine-tune epochaccuracyckpt
simmim400---link
simmim400impr.30077.4link
simmim_d2_distill400---link
simmim_d2_distill400impr.30077.8link