SVQA

July 2, 2018 ยท View on GitHub

The SVQA(Synthetic Video Question Answering) dataset contains 12000 videos and around 120k QA pairs. Videos and QA pairs are all generated automatically with minimal language biases and clearly defined question categories. The dataset can facilitate the analysis on models reasoning skills.

You can download the dataset from this link.

Video and QA Pair Examples

QA CategoryQuestionAnswerVideo(GIF)
Attribute Comparisonno
Count5
Queryblue
Integer Comparisonno
Existyes

Statistics of SVQA

Question CategorySub CategoryTrainValTest
Count1932027605520
Exist67209601920
QueryColor756010562160
Size756010562160
Action Type67209361920
Direction756010562160
Shape756010562160
Integer ComparisonMore2520600720
Equal2520600720
Less2520600720
Attribute ComparisonColor2520216720
Size2520216720
Action Type2520216720
Direction2520216720
Shape2520216720
Total QA pairs831601188023760
Total Videos840012002400