I3D, I3D Nonlocal

August 2, 2021 ยท View on GitHub

Use following model configs.

  • model_configs/i3d_resnet50.py
  • model_configs/i3dnonlocal_resnet50.py

Kinetics

Reference: facebookresearch/SlowFast

architecturedepthpretrainframe length x sample ratetop1top5modelconfig
I3DR50-8 x 873.590.8linkKinetics/c2/I3D_8x8_R50
I3D NLNR50-8 x 874.091.1linkKinetics/c2/I3D_NLN_8x8_R50

Reference: facebookresearch/video-nonlocal-net

scriptinput framesfreeze bn?3D conv?non-local?top1in papertop5modellogs
run_i3d_baseline_400k_32f.sh32-Yes-73.673.390.8linklink
run_i3d_nlnet_400k_32f.sh32-YesYes74.974.991.6linklink

HMDB-51

This model is trained with PyVideoAI.
Top1/5 accuracy is calculated using 1 spatial centre crop and 5 temporal crops.
dataset_configs/hmdb.py
model_configs/i3d_resnet50.py
exp_configs/hmdb/i3d_resnet50-crop224_lr0001_batch8_8x8_largejit_plateau_1scrop5tcrop_split1.py

architecturePretrainframe length x sampling strideTop1 (highest/last)Top5 (highest/last)config, logmodel (last)TensorBoard
I3D-ResNet50Kinetics8 x 873.20 / 72.9494.05 / 94.05linklinklink

ImageNet

Reference: facebookresearch/SlowFast

architecturedepthTop1Top5model
ResNetR5023.66.8link