SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 841850 of 1149 papers

TitleStatusHype
Global Motion Understanding in Large-Scale Video Object Segmentation0
Global Self-Attention Networks0
Global Self-Attention Networks for Image Recognition0
GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding0
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation0
Gradient Frequency Modulation for Visually Explaining Video Understanding Models0
GraphVid: It Only Takes a Few Nodes to Understand a Video0
Grounded Objects and Interactions for Video Captioning0
Grounded Video Situation Recognition0
Grounding Action Descriptions in Videos0
Show:102550
← PrevPage 85 of 115Next →

No leaderboard results yet.