SOTAVerified

Zero-Shot Video Question Answer

This task present the results of Zeroshot Question Answer results on TGIF-QA dataset for LLM powered Video Conversational Models.

Papers

Showing 7685 of 85 papers

TitleStatusHype
Verbs in Action: Improving verb understanding in video-language modelsCode0
ViperGPT: Visual Inference via Python Execution for ReasoningCode3
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function ApproximationCode0
InternVideo: General Video Foundation Models via Generative and Discriminative LearningCode4
0/1 Deep Neural Networks via Block Coordinate Descent0
Zero-Shot Video Question Answering via Frozen Bidirectional Language ModelsCode1
Flamingo: a Visual Language Model for Few-Shot LearningCode4
MVB: A Large-Scale Dataset for Baggage Re-Identification and Merged Siamese NetworksCode0
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question AnsweringCode0
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question AnsweringCode0
Show:102550
← PrevPage 4 of 4Next →

No leaderboard results yet.