Dataset Preparation for Evaluation

March 19, 2025 · View on GitHub

We provide scripts to download and prepare the datasets for evaluation. The datasets include: DAVIS, DyCheck, ADT, and, TUM-dynamics.

Note

The scripts provided here are for reference only. Please ensure you have obtained the necessary licenses from the original dataset providers before proceeding.

Download and Prepare Datasets

DAVIS

To download and prepare the DAVIS dataset, execute:

cd data
python download_prepare_davis.py
cd ..

DyCheck

Download the DyCheck dataset processed by Shape of Motion in data, then execute:

cd data
python prepare_iphone.py
cd ..

ADT

To download the ADT dataset, fowllow TAPVid-3D to prepare TAPVid environment, then execute:

cd data
conda activate TAPVid
python download_adt.py
cd ..

To prepare the ADT dataset, execute:

cd data
conda activate easi3r
python prepare_adt.py
cd ..

TUM-dynamics

To download the TUM-dynamics dataset, execute:

cd data
bash download_tum.sh
cd ..

To prepare the TUM-dynamics dataset, execute:

cd data
python prepare_tum.py
cd ..

Evaluation Script

DAVIS

To evaluate the DAVIS dataset, execute:

CUDA_VISIBLE_DEVICES=4,5,6,7 torchrun --nproc_per_node=4 --master_port=29604 launch.py \
    --mode=eval_pose \
    --pretrained="checkpoints/DUSt3R_ViTLarge_BaseDecoder_512_dpt.pth"   \
    --eval_dataset=davis --output_dir="results/davis/easi3r_dust3r" \
    --use_atten_mask
# To change backbone, --pretrained="checkpoints/MonST3R_PO-TA-S-W_ViTLarge_BaseDecoder_512_dpt.pth"
# To use SAM2, add: --sam2_mask_refine

If you just need dynamic mask, execute:

CUDA_VISIBLE_DEVICES=4,5,6,7 torchrun --nproc_per_node=4 --master_port=29604 launch.py \
    --mode=eval_pose --n_iter 0 \
    --pretrained="checkpoints/MonST3R_PO-TA-S-W_ViTLarge_BaseDecoder_512_dpt.pth"   \
    --eval_dataset=davis --output_dir="results/davis/easi3r_monst3r_sam" \
    --use_atten_mask --sam2_mask_refine

The results will be saved in the results/davis/easi3r_monst3r_sam folder. You could then run python mask_metric.py --results_path results/davis/easi3r_monst3r_sam to evaluate the mask results, and run python vis_attention.py --method_name easi3r_monst3r_sam --base_output_dir results/visualization to see the visualization of attention as in the webpage.

DyCheck

To evaluate the DyCheck dataset, execute:

CUDA_VISIBLE_DEVICES=4,5,6,7 torchrun --nproc_per_node=4 --master_port=29604 launch.py \
    --mode=eval_pose  --no_crop \
    --pretrained="checkpoints/DUSt3R_ViTLarge_BaseDecoder_512_dpt.pth"   \
    --eval_dataset=iphone --output_dir="results/iphone/easi3r_dust3r" \
    --use_atten_mask
# To change backbone, --pretrained="checkpoints/MonST3R_PO-TA-S-W_ViTLarge_BaseDecoder_512_dpt.pth"
# To use SAM2, add: --sam2_mask_refine

The results will be saved in the results/iphone/easi3r_dust3r folder. You could then run CUDA_VISIBLE_DEVICES=4 python point_metric.py --result_path results/iphone to evaluate the reconstruction results.

ADT

To evaluate the ADT dataset, execute:

CUDA_VISIBLE_DEVICES=4,5,6,7 torchrun --nproc_per_node=4 --master_port=29604 launch.py \
    --mode=eval_pose \
    --pretrained="checkpoints/DUSt3R_ViTLarge_BaseDecoder_512_dpt.pth"   \
    --eval_dataset=adt --output_dir="results/adt/easi3r_dust3r" \
    --use_atten_mask
# To change backbone, --pretrained="checkpoints/MonST3R_PO-TA-S-W_ViTLarge_BaseDecoder_512_dpt.pth"
# To use SAM2, add: --sam2_mask_refine

The results will be saved in the results/adt/easi3r_dust3r folder.

TUM-dynamics

To evaluate the TUM-dynamics dataset, execute:

CUDA_VISIBLE_DEVICES=4,5,6,7 torchrun --nproc_per_node=4 --master_port=29604 launch.py \
    --mode=eval_pose \
    --pretrained="checkpoints/DUSt3R_ViTLarge_BaseDecoder_512_dpt.pth"   \
    --eval_dataset=tum --output_dir="results/tum/easi3r_dust3r" \
    --use_atten_mask
# To change backbone, --pretrained="checkpoints/MonST3R_PO-TA-S-W_ViTLarge_BaseDecoder_512_dpt.pth"
# To use SAM2, add: --sam2_mask_refine

The results will be saved in the results/tum/easi3r_dust3r folder.