SOTAVerified

Zero-Shot Video Question Answer

This task present the results of Zeroshot Question Answer results on TGIF-QA dataset for LLM powered Video Conversational Models.

Papers

Showing 6170 of 85 papers

TitleStatusHype
OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data GenerationCode1
Self-Chained Image-Language Model for Video Localization and Question AnsweringCode1
Zero-Shot Video Question Answering via Frozen Bidirectional Language ModelsCode1
ENTER: Event Based Interpretable Reasoning for VideoQA0
VidCtx: Context-aware Video Question Answering with Image ModelsCode0
GPT-4o System Card0
Video Instruction Tuning With Synthetic Data0
Question-Answering Dense Video EventsCode0
LLaVA-OneVision: Easy Visual Task TransferCode0
GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding0
Show:102550
← PrevPage 7 of 9Next →

No leaderboard results yet.