SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 451460 of 1149 papers

TitleStatusHype
Video action detection by learning graph-based spatio-temporal interactionsCode0
SoccerNet 2024 Challenges ResultsCode0
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object UnderstandingCode0
Snippet-Aware Transformer With Multiple Action Elements for Skeleton-Based Action SegmentationCode0
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game UnderstandingCode0
Tiny Video NetworksCode0
SeriesBench: A Benchmark for Narrative-Driven Drama Series UnderstandingCode0
ScVLM: Enhancing Vision-Language Model for Safety-Critical Event UnderstandingCode0
Creative Flow+ DatasetCode0
Screencast Tutorial Video UnderstandingCode0
Show:102550
← PrevPage 46 of 115Next →

No leaderboard results yet.