SOTAVerified|Agents Browse Leaderboard About Blog

Video Summarization

Video Summarization aims to generate a short synopsis that summarizes the video content by selecting its most informative and important parts. The produced summary is usually composed of a set of representative video frames (a.k.a. video key-frames), or video fragments (a.k.a. video key-fragments) that have been stitched in chronological order to form a shorter video. The former type of a video summary is known as video storyboard, and the latter type is known as video skim.

Source: Video Summarization Using Deep Neural Networks: A Survey Image credit: iJRASET

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 280 papers

Title	Date	Tasks	Status
TRIM: A Self-Supervised Video Summarization Framework Maximizing Temporal Relative Information and Representativeness	Jun 25, 2025	Self-Supervised LearningSupervised Video Summarization	—Unverified
Prompts to Summaries: Zero-Shot Language-Guided Video Summarization	Jun 12, 2025	GPUQuery focused video summarization	—Unverified
MF2Summ: Multimodal Fusion for Video Summarization with Temporal Alignment	Jun 12, 2025	Video Summarization	—Unverified
Enhancing Video Memorability Prediction with Text-Motion Cross-modal Contrastive Loss and Its Application in Video Summarization	Jun 10, 2025	PredictionVideo Summarization	—Unverified
TriPSS: A Tri-Modal Keyframe Extraction Framework Using Perceptual, Structural, and Semantic Representations	Jun 3, 2025	RetrievalVideo Summarization	—Unverified
Unsupervised Transcript-assisted Video Summarization and Highlight Detection	May 29, 2025	Highlight DetectionReinforcement Learning (RL)	—Unverified
REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing	May 24, 2025	Language ModelingLanguage Modelling	—Unverified
SD-VSum: A Method and Dataset for Script-Driven Video Summarization	May 6, 2025	Video Summarization	CodeCode Available
Video Summarization with Large Language Models	Apr 15, 2025	Large Language ModelVideo Summarization	—Unverified
Automatic Detection of Intro and Credits in Video using CLIP and Multihead Attention	Apr 13, 2025	CPUHighlight Detection	—Unverified

Show:10 25 50

← PrevPage 1 of 28Next →

All datasets SumMe TvSum Shot2Story20K Query-Focused Video Summarization Dataset Mr. HiSum VideoXum

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	VTSUM-BLIP	1 shot Micro-F1	23.5	—	Unverified