SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 731740 of 1149 papers

TitleStatusHype
Generative Frame Sampler for Long Video Understanding0
Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning0
GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning0
Global Motion Understanding in Large-Scale Video Object Segmentation0
Global Self-Attention Networks0
Global Self-Attention Networks for Image Recognition0
GPT-4o: Visual perception performance of multimodal large language models in piglet activity understanding0
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation0
Gradient Frequency Modulation for Visually Explaining Video Understanding Models0
GraphVid: It Only Takes a Few Nodes to Understand a Video0
Show:102550
← PrevPage 74 of 115Next →

No leaderboard results yet.