SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 451475 of 1149 papers

TitleStatusHype
HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge UnderstandingCode0
Hallucination Mitigation Prompts Long-term Video UnderstandingCode0
Video action detection by learning graph-based spatio-temporal interactionsCode0
Spatio-Temporal Perturbations for Video AttributionCode0
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object UnderstandingCode0
SoccerNet 2024 Challenges ResultsCode0
Streaming Detection of Queried Event StartCode0
Situational Scene Graph for Structured Human-centric Situation UnderstandingCode0
Creative Flow+ DatasetCode0
ScVLM: Enhancing Vision-Language Model for Safety-Critical Event UnderstandingCode0
Screencast Tutorial Video UnderstandingCode0
ScaleLong: A Multi-Timescale Benchmark for Long Video UnderstandingCode0
SeriesBench: A Benchmark for Narrative-Driven Drama Series UnderstandingCode0
Snippet-Aware Transformer With Multiple Action Elements for Skeleton-Based Action SegmentationCode0
Gaussian Temporal Awareness Networks for Action LocalizationCode0
Relation-aware Hierarchical Attention Framework for Video Question AnsweringCode0
Re-ID-AR: Improved Person Re-identification in Video via Joint Weakly Supervised Action RecognitionCode0
Representation Flow for Action RecognitionCode0
Contextual Explainable Video Representation: Human Perception-based UnderstandingCode0
DriftNet: Aggressive Driving Behavior Classification using 3D EfficientNet ArchitectureCode0
Recurrent Space-time Graph Neural NetworksCode0
FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story VideosCode0
Constrained-size Tensorflow Models for YouTube-8M Video Understanding ChallengeCode0
VideoDG: Generalizing Temporal Relations in Videos to Novel DomainsCode0
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game UnderstandingCode0
Show:102550
← PrevPage 19 of 46Next →

No leaderboard results yet.