SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 651675 of 1149 papers

TitleStatusHype
DOAD: Decoupled One Stage Action Detection Network0
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering0
Domain Adaptation of VLM for Soccer Video Understanding0
DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation0
Dr2Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning0
DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model0
DrVideo: Document Retrieval Based Long Video Understanding0
Dilated Temporal Relational Adversarial Network for Generic Video Summarization0
DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM0
DualX-VSR: Dual Axial SpatialTemporal Transformer for Real-World Video Super-Resolution without Motion Compensation0
DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs0
Dynamic Appearance: A Video Representation for Action Recognition with Joint Training0
Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition0
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering0
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding0
DynTok: Dynamic Compression of Visual Tokens for Efficient and Effective Video Understanding0
EAGLE: Egocentric AGgregated Language-video Engine0
Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey0
Efficient Modelling Across Time of Human Actions and Interactions0
Efficient Motion-Aware Video MLLM0
Efficient Video Understanding via Layered Multi Frame-Rate Analysis0
EgoEnv: Human-centric environment representations from egocentric video0
Egocentric Video Task Translation0
EgoDistill: Egocentric Head Motion Distillation for Efficient Video Understanding0
Egok360: A 360 Egocentric Kinetic Human Activity Video Dataset0
Show:102550
← PrevPage 27 of 46Next →

No leaderboard results yet.