TPT for VideoQA

May 19, 2022 ยท View on GitHub

This is the PyTorch Implementation of our paper "Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering".

Reference

@article{peng2021,
     title={Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering},
     author={Peng Min, Wang Chongyang, Gao Yuan, Shi Yu, Zhou Xiang-Dong},
     year={2021}}