SOTAVerified

EgoSchema

Papers

Showing 2130 of 40 papers

TitleStatusHype
VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMsCode1
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model0
VDMA: Video Question Answering with Dynamically Generated Multi-Agents0
HCQA @ Ego4D EgoSchema Challenge 2024Code1
DrVideo: Document Retrieval Based Long Video Understanding0
Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QACode1
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long VideosCode2
TOPA: Extending Large Language Models for Video Understanding via Text-Only Pre-AlignmentCode1
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering0
Language Repository for Long Video UnderstandingCode1
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.