SOTAVerified

TGIF-Frame

Papers

Showing 1115 of 15 papers

TitleStatusHype
Zero-Shot Video Question Answering via Frozen Bidirectional Language ModelsCode1
Lightweight Recurrent Cross-modal Encoder for Video Question AnsweringCode0
VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending0
MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling0
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training0
Show:102550
← PrevPage 2 of 2Next →

No leaderboard results yet.