SOTAVerified

TGIF-Action

Papers

Showing 17 of 7 papers

TitleStatusHype
All in One: Exploring Unified Video-Language Pre-trainingCode2
MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation ModelsCode1
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual ModelingCode1
Clover: Towards A Unified Video-Language Alignment and Fusion ModelCode1
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training0
MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling0
Lightweight Recurrent Cross-modal Encoder for Video Question AnsweringCode0
Show:102550

No leaderboard results yet.