SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 851900 of 1149 papers

TitleStatusHype
STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition0
EgoDistill: Egocentric Head Motion Distillation for Efficient Video Understanding0
PIDRo: Parallel Isomeric Attention with Dynamic Routing for Text-Video Retrieval0
Self-Supervised Object Detection from Egocentric Videos0
Relational Space-Time Query in Long-Form Videos0
Few-Shot Referring Relationships in VideosCode0
UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding0
Inverse Compositional Learning for Weakly-supervised Relation Grounding0
Multimodal High-order Relation Transformer for Scene Boundary Detection0
Joint Engagement Classification using Video Augmentation Techniques for Multi-person Human-robot Interaction0
Inductive Attention for Video Action Anticipation0
Egocentric Video Task Translation0
Contextual Explainable Video Representation: Human Perception-based UnderstandingCode0
PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers using Synthetic Scene Data0
Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images0
Spatio-Temporal Crop Aggregation for Video Representation Learning0
Dynamic Appearance: A Video Representation for Action Recognition with Joint Training0
A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset0
Masked Autoencoders for Egocentric Video Understanding @ Ego4D Challenge 2022Code0
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 20220
Grounded Video Situation Recognition0
How Would The Viewer Feel? Estimating Wellbeing From Video ScenariosCode0
Self-supervised video pretraining yields robust and more human-aligned visual representations0
Students taught by multimodal teachers are superior action recognizers0
Compressed Vision for Efficient Video Understanding0
Learning to Focus on the Foreground for Temporal Sentence Grounding0
In-the-Wild Video Question Answering0
Speeding Up Action Recognition Using Dynamic Accumulation of Residuals in Compressed Domain0
AVT: Audio-Video Transformer for Multimodal Action Recognition0
WildQA: In-the-Wild Video Question Answering0
Foundations and Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions0
Visual Subtitle Feature Enhanced Video Outline Generation0
Identifying Auxiliary or Adversarial Tasks Using Necessary Condition Analysis for Adversarial Multi-task Video Understanding0
Motion Sensitive Contrastive Learning for Self-supervised Video Representation0
Exploring Anchor-based Detection for Ego4D Natural Language Query0
SA-NET.v2: Real-time vehicle detection from oblique UAV images with use of uncertainty estimation in deep meta-learning0
Two-Stream Transformer Architecture for Long Video Understanding0
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation0
EgoEnv: Human-centric environment representations from egocentric video0
Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 20220
AE-Net:Adjoint Enhancement Network for Efficient Action Recognition in Video Understanding0
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection0
SVGraph: Learning Semantic Graphs from Instructional Videos0
GraphVid: It Only Takes a Few Nodes to Understand a Video0
Multimodal Intent Discovery from Livestream Videos0
(Un)likelihood Training for Interpretable EmbeddingCode0
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering0
Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding ApproachCode0
Technical Report for CVPR 2022 LOVEU AQTC ChallengeCode0
Multimodal Dialogue State TrackingCode0
Show:102550
← PrevPage 18 of 23Next →

No leaderboard results yet.