SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 251300 of 1149 papers

TitleStatusHype
CAST: Cross-Attention in Space and Time for Video Action RecognitionCode1
Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional PropertiesCode1
Panoptic Video Scene Graph GenerationCode1
Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer LearningCode1
Mug-STAN: Adapting Image-Language Pretrained Models for General Video UnderstandingCode1
MM-VID: Advancing Video Understanding with GPT-4V(ision)Code1
BT-Adapter: Video Conversation is Feasible Without Video Instruction TuningCode1
End-to-End Streaming Video Temporal Action Segmentation with Reinforce LearningCode1
SoccerNet 2023 Challenges ResultsCode1
CEFHRI: A Communication Efficient Federated Learning Framework for Recognizing Industrial Human-Robot InteractionCode1
Spherical Vision Transformer for 360-degree Video Saliency PredictionCode1
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud VideosCode1
EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language UnderstandingCode1
Helping Hands: An Object-Aware Ego-Centric Video Recognition ModelCode1
Multimodal Distillation for Egocentric Action RecognitionCode1
Self-Adaptive Sampling for Efficient Video Question-Answering on Image--Text ModelsCode1
An overview on the evaluated video retrieval tasks at TRECVID 2022Code1
Multi-Granularity Hand Action DetectionCode1
EPIC Fields: Marrying 3D Geometry and Video UnderstandingCode1
VideoLLM: Modeling Video Sequence with Large Language ModelsCode1
Transformer-Based Model for Monocular Visual Odometry: A Video Understanding ApproachCode1
MH-DETR: Video Moment and Highlight Detection with Cross-modal TransformerCode1
Event-Free Moving Object Segmentation from Moving Ego VehicleCode1
Leveraging triplet loss for unsupervised action segmentationCode1
Procedure-Aware Pretraining for Instructional Video UnderstandingCode1
Whether and When does Endoscopy Domain Pretraining Make Sense?Code1
Streaming Video ModelCode1
TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action RecognitionCode1
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential VideosCode1
Dual-path Adaptation from Image to Video TransformersCode1
TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action LocalizationCode1
Localizing Moments in Long Video Via Multimodal GuidanceCode1
Test of Time: Instilling Video-Language Models with a Sense of TimeCode1
Boosting Single Image Super-Resolution via Partial Channel ShiftingCode1
Modeling Video As Stochastic Processes for Fine-Grained Video Representation LearningCode1
Towards Smooth Video CompositionCode1
MOMA-LRG: Language-Refined Graphs for Multi-Object Multi-Actor Activity ParsingCode1
Contrastive Masked Autoencoders for Self-Supervised Video HashingCode1
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal TokensCode1
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D ChallengesCode1
VTC: Improving Video-Text Retrieval with User CommentsCode1
EgoTaskQA: Understanding Human Tasks in Egocentric VideosCode1
SoccerNet 2022 Challenges ResultsCode1
Learning Transferable Spatiotemporal Representations from Natural Script KnowledgeCode1
Streaming Video Temporal Action Segmentation In Real TimeCode1
Panoramic Vision Transformer for Saliency Detection in 360° VideosCode1
EchoCoTr: Estimation of the Left Ventricular Ejection Fraction from Spatiotemporal EchocardiographyCode1
DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with High Quality AnnotationsCode1
Point Primitive Transformer for Long-Term 4D Point Cloud Video UnderstandingCode1
Static and Dynamic Concepts for Self-supervised Video Representation LearningCode1
Show:102550
← PrevPage 6 of 23Next →

No leaderboard results yet.