SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 231240 of 1149 papers

TitleStatusHype
Helping Hands: An Object-Aware Ego-Centric Video Recognition ModelCode1
BT-Adapter: Video Conversation is Feasible Without Video Instruction TuningCode1
Grounded Question-Answering in Long Egocentric VideosCode1
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video UnderstandingCode1
Panoramic Vision Transformer for Saliency Detection in 360° VideosCode1
NExT-QA: Next Phase of Question-Answering to Explaining Temporal ActionsCode1
Free Lunch for Surgical Video Understanding by Distilling Self-SupervisionsCode1
Event-Free Moving Object Segmentation from Moving Ego VehicleCode1
Compositional Video Understanding with Spatiotemporal Structure-based TransformersCode1
From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living ActivitiesCode1
Show:102550
← PrevPage 24 of 115Next →

No leaderboard results yet.