SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 776800 of 1149 papers

TitleStatusHype
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation0
Point Primitive Transformer for Long-Term 4D Point Cloud Video UnderstandingCode1
Static and Dynamic Concepts for Self-supervised Video Representation LearningCode1
EgoEnv: Human-centric environment representations from egocentric video0
Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 20220
AE-Net:Adjoint Enhancement Network for Efficient Action Recognition in Video Understanding0
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection0
Spotting Temporally Precise, Fine-Grained Events in VideoCode1
Clover: Towards A Unified Video-Language Alignment and Fusion ModelCode1
SVGraph: Learning Semantic Graphs from Instructional Videos0
Is Appearance Free Action Recognition Possible?Code1
Federated Self-supervised Learning for Video UnderstandingCode1
GraphVid: It Only Takes a Few Nodes to Understand a Video0
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering0
Multimodal Intent Discovery from Livestream Videos0
(Un)likelihood Training for Interpretable EmbeddingCode0
Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding ApproachCode0
Technical Report for CVPR 2022 LOVEU AQTC ChallengeCode0
ST-Adapter: Parameter-Efficient Image-to-Video Transfer LearningCode1
REVECA -- Rich Encoder-decoder framework for Video Event CAptionerCode1
Multimodal Dialogue State TrackingCode0
Stand-Alone Inter-Frame Attention in Video ModelsCode1
Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens0
A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action DetectorCode1
Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey0
Show:102550
← PrevPage 32 of 46Next →

No leaderboard results yet.