SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1031–1040 of 1149 papers

Title	Date	Tasks	Status	Hype	Score
Prompting Video-Language Foundation Models with Domain-specific Fine-grained Heuristics for Video Question Answering	Oct 12, 2024	Question AnsweringVideo Question Answering	—Unverified	0	0
PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers using Synthetic Scene Data	Dec 8, 2022	Action RecognitionPrompt Learning	—Unverified	0	0
Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval	Apr 17, 2025	Partially Relevant Video RetrievalRetrieval	—Unverified	0	0
PVChat: Personalized Video Chat with One-Shot Learning	Mar 21, 2025	One-Shot LearningQuestion Answering	—Unverified	0	0
PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models	Dec 12, 2024	Video Understanding	—Unverified	0	0
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild	Apr 15, 2025	SegmentationSemantic Segmentation	—Unverified	0	0
PYSKL: a toolbox for skeleton-based video understanding	Apr 2, 2022	Skeleton Based Action RecognitionVideo Understanding	—Unverified	0	0
Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs	Sep 30, 2024	BenchmarkingMultiple-choice	—Unverified	0	0
Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs	Jan 1, 2025	Multiple-choiceVideo Generation	—Unverified	0	0
Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs	Jun 27, 2025	MMEVideo MME	—Unverified	0	0

Show:10 25 50

← PrevPage 104 of 115Next →

No leaderboard results yet.