| iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability | Jun 25, 2021 | Bias DetectionQuestion Answering | —Unverified | 0 |
| Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering | Jun 19, 2021 | AI AgentQuestion Answering | CodeCode Available | 0 |
| Learning to Rehearse in Long Sequence Memorization | Jun 2, 2021 | MemorizationQuestion Answering | —Unverified | 0 |
| Relation-aware Hierarchical Attention Framework for Video Question Answering | May 13, 2021 | Question AnsweringRelation | CodeCode Available | 0 |
| Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering | Apr 29, 2021 | Question AnsweringVideo Question Answering | —Unverified | 0 |
| Object-Centric Representation Learning for Video Question Answering | Apr 12, 2021 | ObjectQuestion Answering | —Unverified | 0 |
| FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation Framework | Apr 9, 2021 | Language ModellingMultiple-choice | CodeCode Available | 0 |
| Video Question Answering with Phrases via Semantic Roles | Apr 8, 2021 | Question AnsweringVideo Question Answering | —Unverified | 0 |
| CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning | Apr 1, 2021 | Question AnsweringRepresentation Learning | —Unverified | 0 |
| AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning | Mar 30, 2021 | Question AnsweringVideo Question Answering | —Unverified | 0 |
| HySTER: A Hybrid Spatio-Temporal Event Reasoner | Jan 17, 2021 | Inductive logic programmingQuestion Answering | —Unverified | 0 |
| Recent Advances in Video Question Answering: A Review of Datasets and Methods | Jan 15, 2021 | Information RetrievalMachine Translation | —Unverified | 0 |
| End-to-End Video Question-Answer Generation with Generator-Pretester Network | Jan 5, 2021 | Answer GenerationQuestion-Answer-Generation | CodeCode Available | 0 |
| HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering | Jan 1, 2021 | Question AnsweringRelational Reasoning | —Unverified | 0 |
| Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments | Jan 1, 2021 | Question AnsweringVideo Question Answering | —Unverified | 0 |
| Video Question Answering Using Language-Guided Deep Compressed-Domain Video Feature | Jan 1, 2021 | Question AnsweringVideo Compression | —Unverified | 0 |
| Trying Bilinear Pooling in Video-QA | Dec 18, 2020 | Question AnsweringVideo Question Answering | —Unverified | 0 |
| On Modality Bias in the TVQA Dataset | Dec 18, 2020 | Question AnsweringVideo Question Answering | CodeCode Available | 0 |
| Open-Ended Multi-Modal Relational Reasoning for Video Question Answering | Dec 1, 2020 | Question AnsweringRelational Reasoning | CodeCode Available | 0 |
| iPerceive: Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Nov 16, 2020 | Common Sense ReasoningDense Video Captioning | —Unverified | 0 |
| ActBERT: Learning Global-Local Video-Text Representations | Nov 14, 2020 | Action SegmentationQuestion Answering | CodeCode Available | 0 |
| Co-attentional Transformers for Story-Based Video Understanding | Oct 27, 2020 | Question AnsweringVideo Question Answering | —Unverified | 0 |
| Hierarchical Conditional Relation Networks for Multimodal Video Question Answering | Oct 18, 2020 | Question AnsweringRelation | —Unverified | 0 |
| Self-supervised pre-training and contrastive representation learning for multiple-choice video QA | Sep 17, 2020 | Auxiliary LearningContrastive Learning | —Unverified | 0 |
| Data augmentation techniques for the Video Question Answering task | Aug 22, 2020 | Data AugmentationQuestion Answering | —Unverified | 0 |