SOTAVerified

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Showing 721730 of 1149 papers

TitleStatusHype
Fewer Tokens and Fewer Videos: Extending Video Understanding Abilities in Large Vision-Language Models0
MeMSVD: Long-Range Temporal Structure Capturing Using Incremental SVD0
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation0
Semantic Segmentation on VSPW Dataset through Masked Video Consistency0
3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation0
Contrastive Language Video Time Pre-training0
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation0
HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model0
Temporal Grounding of Activities using Multimodal Large Language Models0
MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning0
Show:102550
← PrevPage 73 of 115Next →

No leaderboard results yet.