SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 876900 of 1149 papers

TitleStatusHype
Learning to Focus on the Foreground for Temporal Sentence Grounding0
In-the-Wild Video Question Answering0
Speeding Up Action Recognition Using Dynamic Accumulation of Residuals in Compressed Domain0
AVT: Audio-Video Transformer for Multimodal Action Recognition0
WildQA: In-the-Wild Video Question Answering0
Foundations and Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions0
Visual Subtitle Feature Enhanced Video Outline Generation0
Identifying Auxiliary or Adversarial Tasks Using Necessary Condition Analysis for Adversarial Multi-task Video Understanding0
Motion Sensitive Contrastive Learning for Self-supervised Video Representation0
Exploring Anchor-based Detection for Ego4D Natural Language Query0
SA-NET.v2: Real-time vehicle detection from oblique UAV images with use of uncertainty estimation in deep meta-learning0
Two-Stream Transformer Architecture for Long Video Understanding0
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation0
EgoEnv: Human-centric environment representations from egocentric video0
Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 20220
AE-Net:Adjoint Enhancement Network for Efficient Action Recognition in Video Understanding0
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection0
SVGraph: Learning Semantic Graphs from Instructional Videos0
GraphVid: It Only Takes a Few Nodes to Understand a Video0
Multimodal Intent Discovery from Livestream Videos0
(Un)likelihood Training for Interpretable EmbeddingCode0
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering0
Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding ApproachCode0
Technical Report for CVPR 2022 LOVEU AQTC ChallengeCode0
Multimodal Dialogue State TrackingCode0
Show:102550
← PrevPage 36 of 46Next →

No leaderboard results yet.