SOTAVerified

Zero-Shot Video Question Answer

This task present the results of Zeroshot Question Answer results on TGIF-QA dataset for LLM powered Video Conversational Models.

Papers

Showing 6170 of 85 papers

TitleStatusHype
OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data GenerationCode1
Agentic Keyframe Search for Video Question AnsweringCode1
Self-Chained Image-Language Model for Video Localization and Question AnsweringCode1
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function ApproximationCode0
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question AnsweringCode0
LLaVA-OneVision: Easy Visual Task TransferCode0
MVB: A Large-Scale Dataset for Baggage Re-Identification and Merged Siamese NetworksCode0
Question-Answering Dense Video EventsCode0
Question-Instructed Visual Descriptions for Zero-Shot Video Question AnsweringCode0
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question AnsweringCode0
Show:102550
← PrevPage 7 of 9Next →

No leaderboard results yet.