SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 431440 of 1149 papers

TitleStatusHype
Dr2Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning0
Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation0
Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval0
Learning Dynamic MRI Reconstruction with Convolutional Network Assisted Reconstruction Swin Transformer0
DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation0
BERT for Large-scale Video Segment Classification with Test-time Augmentation0
AMEGO: Active Memory from long EGOcentric videos0
Domain Adaptation of VLM for Soccer Video Understanding0
Actor-Action Semantic Segmentation with Grouping Process Models0
BEARCUBS: A benchmark for computer-using web agents0
Show:102550
← PrevPage 44 of 115Next →

No leaderboard results yet.