| Video Question Answering for People with Visual Impairments Using an Egocentric 360-Degree Camera | May 30, 2024 | Question AnsweringVideo Question Answering | —Unverified | 0 | 0 |
| On Scaling Up a Multilingual Vision and Language Model | Jan 1, 2024 | document understandingIn-Context Learning | —Unverified | 0 | 0 |
| Video Question Answering on Screencast Tutorials | Aug 2, 2020 | Question AnsweringVideo Question Answering | —Unverified | 0 | 0 |
| Open-Ended Long-Form Video Question Answering via Hierarchical Convolutional Self-Attention Networks | Jun 28, 2019 | Answer GenerationDecoder | —Unverified | 0 | 0 |
| Video Question Answering Using CLIP-Guided Visual-Text Attention | Mar 6, 2023 | General KnowledgeQuestion Answering | —Unverified | 0 | 0 |
| CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions | Nov 16, 2021 | counterfactualDescriptive | —Unverified | 0 | 0 |
| Overview of the MedVidQA 2022 Shared Task on Medical Video Question-Answering | May 1, 2022 | Question AnsweringVideo Classification | —Unverified | 0 | 0 |
| Overview of the NLPCC 2025 Shared Task 4: Multi-modal, Multilingual, and Multi-hop Medical Instructional Video Question Answering Challenge | May 11, 2025 | Multimodal ReasoningQuestion Answering | —Unverified | 0 | 0 |
| Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track | Dec 15, 2024 | Image CaptioningMedical Question Answering | —Unverified | 0 | 0 |
| Video Question Answering Using Language-Guided Deep Compressed-Domain Video Feature | Jan 1, 2021 | Question AnsweringVideo Compression | —Unverified | 0 | 0 |
| Parameter-free Video Segmentation for Vision and Language Understanding | Mar 3, 2025 | Question AnsweringVideo Question Answering | —Unverified | 0 | 0 |
| Video Question Answering via Attribute-Augmented Attention Network Learning | Jul 20, 2017 | AttributeInformation Retrieval | —Unverified | 0 | 0 |
| Pegasus-v1 Technical Report | Apr 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries | Dec 26, 2024 | Question AnsweringVideo Question Answering | —Unverified | 0 | 0 |
| Contrastive Video-Language Learning with Fine-grained Frame Sampling | Oct 10, 2022 | Question AnsweringRepresentation Learning | —Unverified | 0 | 0 |
| Perception Test 2023: A Summary of the First Challenge And Outcome | Dec 20, 2023 | BenchmarkingGrounded Video Question Answering | —Unverified | 0 | 0 |
| Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark | Nov 29, 2024 | BenchmarkingGrounded Video Question Answering | —Unverified | 0 | 0 |
| Continuous Perception Benchmark | Aug 15, 2024 | Question AnsweringVideo Question Answering | —Unverified | 0 | 0 |
| Composing Ensembles of Pre-trained Models via Iterative Consensus | Oct 20, 2022 | Arithmetic ReasoningImage Generation | —Unverified | 0 | 0 |
| Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning | Jan 9, 2025 | BenchmarkingQuestion Answering | —Unverified | 0 | 0 |
| PolySmart @ TRECVid 2024 Medical Video Question Answering | Dec 20, 2024 | Question AnsweringRetrieval | —Unverified | 0 | 0 |
| Poze: Sports Technique Feedback under Data Constraints | Nov 8, 2024 | Pose EstimationQuestion Answering | —Unverified | 0 | 0 |
| CogStream: Context-guided Streaming Video Question Answering | Jun 12, 2025 | Question AnsweringVideo Question Answering | —Unverified | 0 | 0 |
| Prompting Video-Language Foundation Models with Domain-specific Fine-grained Heuristics for Video Question Answering | Oct 12, 2024 | Question AnsweringVideo Question Answering | —Unverified | 0 | 0 |
| QTG-VQA: Question-Type-Guided Architectural for VideoQA Systems | Sep 14, 2024 | Question AnsweringVideo Question Answering | —Unverified | 0 | 0 |