STMixer

May 18, 2023 ยท View on GitHub

This repository gives the official PyTorch implementation of STMixer: A One-Stage Sparse Action Detector (CVPR 2023)

Installation

  • PyTorch == 1.8 or 1.12 (other versions are not tested)
  • tqdm
  • yacs
  • opencv-python
  • tensorboardX
  • SciPy
  • fvcore
  • timm
  • iopath

Data Preparation

Please refer to PySlowFast DATASET.md for AVA dataset preparation.

Model Zoo

BackboneConfigPre-train ModelFramesSampling RateModel
SlowOnly-R50cfgK400416Link
SlowFast-R50cfgK40088Link
SlowFast-R101-NLcfgK60088Link
ViT-B(VideoMAE)cfgK400164Link
ViT-B(VideoMAEv2)cfgK710+K400164Link

Training

python -m torch.distributed.launch --nproc_per_node=8 train_net.py --config-file "config_files/config_file.yaml" --transfer --no-head --use-tfboard

Validation

python -m torch.distributed.launch --nproc_per_node=8 test_net.py --config-file "config_files/config_file.yaml" MODEL.WEIGHT "/path/to/model"

Acknowledgements

We would like to thank Ligeng Chen for his help in drawing the figures in the paper and thank Lei Chen for her surpport in experiments. This project is built upon AlphaAction, AdaMixer and PySlowFast. We also reference and use some code from SparseR-CNN, WOO and VideoMAE. Very sincere thanks to the contributors to these excellent codebases.

Citation

If this project helps you in your research or project, please cite our paper:

@inproceedings{wu2023stmixer,
      title={STMixer: A One-Stage Sparse Action Detector}, 
      author={Tao Wu and Mengqi Cao and Ziteng Gao and Gangshan Wu and Limin Wang},
      booktitle={{CVPR}},
      year={2023}
}