SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 651660 of 1149 papers

TitleStatusHype
DOAD: Decoupled One Stage Action Detection Network0
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering0
Domain Adaptation of VLM for Soccer Video Understanding0
DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation0
Dr2Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning0
DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model0
DrVideo: Document Retrieval Based Long Video Understanding0
Dilated Temporal Relational Adversarial Network for Generic Video Summarization0
DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM0
DualX-VSR: Dual Axial SpatialTemporal Transformer for Real-World Video Super-Resolution without Motion Compensation0
Show:102550
← PrevPage 66 of 115Next →

No leaderboard results yet.