SOTAVerified

Zero-Shot Video Question Answer

This task present the results of Zeroshot Question Answer results on TGIF-QA dataset for LLM powered Video Conversational Models.

Papers

Showing 6170 of 85 papers

TitleStatusHype
BIMBA: Selective-Scan Compression for Long-Range Video Question AnsweringCode1
Zero-Shot and Few-Shot Video Question Answering with Multi-Modal PromptsCode1
Self-Chained Image-Language Model for Video Localization and Question AnsweringCode1
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering0
CinePile: A Long Video Question Answering Dataset and Benchmark0
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs0
ENTER: Event Based Interpretable Reasoning for VideoQA0
GPT-4o System Card0
GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding0
0/1 Deep Neural Networks via Block Coordinate Descent0
Show:102550
← PrevPage 7 of 9Next →

No leaderboard results yet.