SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 771–780 of 1149 papers

Title	Date	Tasks	Status	Hype
VideoPrism: A Foundational Visual Encoder for Video Understanding	Feb 20, 2024	Question AnsweringVideo Question Answering	—Unverified	0
Dynamics Based Neural Encoding with Inter-Intra Region Connectivity	Feb 19, 2024	Video Understanding	—Unverified	0
Are you Struggling? Dataset and Baselines for Struggle Determination in Assembly Videos	Feb 16, 2024	Decision MakingVideo Understanding	CodeCode Available	0
Memory Consolidation Enables Long-Context Video Understanding	Feb 8, 2024	EgoSchemaVideo Understanding	—Unverified	0
A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming	Jan 30, 2024	Video GenerationVideo Understanding	—Unverified	0
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model	Jan 29, 2024	Action DetectionAction Localization	—Unverified	0
Exploring Missing Modality in Multimodal Egocentric Datasets	Jan 21, 2024	Action RecognitionVideo Understanding	—Unverified	0
Learning to Visually Connect Actions and their Effects	Jan 19, 2024	Object TrackingTask Planning	—Unverified	0
CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point Cloud Video Understanding	Jan 17, 2024	Contrastive Learningpoint cloud video understanding	—Unverified	0
Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization	Jan 16, 2024	DecoderDenoising	—Unverified	0

Show:10 25 50

← PrevPage 78 of 115Next →

No leaderboard results yet.