SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 791800 of 1149 papers

TitleStatusHype
(Un)likelihood Training for Interpretable EmbeddingCode0
Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding ApproachCode0
Technical Report for CVPR 2022 LOVEU AQTC ChallengeCode0
ST-Adapter: Parameter-Efficient Image-to-Video Transfer LearningCode1
REVECA -- Rich Encoder-decoder framework for Video Event CAptionerCode1
Multimodal Dialogue State TrackingCode0
Stand-Alone Inter-Frame Attention in Video ModelsCode1
Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens0
A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action DetectorCode1
Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey0
Show:102550
← PrevPage 80 of 115Next →

No leaderboard results yet.