Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–625 of 895 papers

Title	Date	Tasks	Status
SeamSeg: Video Object Segmentation using Patch Seams	Jun 1, 2014	ObjectSemantic Segmentation	—Unverified
SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction	Jul 21, 2025	ObjectSegmentation	—Unverified
SegEQA: Video Segmentation Based Visual Attention for Embodied Question Answering	Oct 1, 2019	Embodied Question AnsweringQuestion Answering	—Unverified
SegGPT: Towards Segmenting Everything in Context	Jan 1, 2023	Few-Shot Semantic SegmentationIn-Context Learning	—Unverified
Segment Every Reference Object in Spatial and Temporal Spaces	Jan 1, 2023	Image SegmentationObject	—Unverified
Selective Video Object Cutout	Feb 28, 2017	Computational EfficiencyObject	—Unverified
Self-Occlusions and Disocclusions in Causal Video Object Segmentation	Dec 1, 2015	ObjectSemantic Segmentation	—Unverified
Self-supervised Motion Representation via Scattering Local Motion Cues	Aug 1, 2020	Action RecognitionOptical Flow Estimation	—Unverified
Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention	Jan 25, 2024	Knowledge DistillationObject	—Unverified
Self-supervised Video Object Segmentation by Motion Grouping	Apr 15, 2021	Motion SegmentationObject	—Unverified
Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging	Apr 22, 2022	ObjectSegmentation	—Unverified
Semantically-Guided Video Object Segmentation	Apr 6, 2017	ObjectSegmentation	—Unverified
Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks	Jan 25, 2022	Action RecognitionObject	—Unverified
Semantic and Sequential Alignment for Referring Video Object Segmentation	Jan 1, 2025	Instance SegmentationReferring Video Object Segmentation	—Unverified
Semantic Segmentation on VSPW Dataset through Aggregation of Transformer Models	Sep 3, 2021	Autonomous DrivingScene Parsing	—Unverified
Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach	Jun 6, 2023	Scene ParsingSemantic Segmentation	—Unverified
Semantic Video Segmentation: A Review on Recent Approaches	Jun 16, 2018	SegmentationSemantic Segmentation	—Unverified
Semantic Video Segmentation by Gated Recurrent Flow Propagation	Dec 28, 2016	Optical Flow EstimationSegmentation	—Unverified
Semantic Video Segmentation for Intracytoplasmic Sperm Injection Procedures	Jan 4, 2021	GPUVideo Segmentation	—Unverified
Semantic video segmentation for autonomous driving	Oct 28, 2020	Autonomous DrivingSegmentation	—Unverified
Semi-Supervised Domain Adaptation for Weakly Labeled Semantic Video Object Segmentation	Jun 7, 2016	Domain AdaptationSegmentation	—Unverified
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024	Jun 2, 2024	Scene ParsingScene Understanding	—Unverified
Sequential Clique Optimization for Video Object Segmentation	Sep 1, 2018	Objectobject-detection	—Unverified
Shift and matching queries for video semantic segmentation	Oct 10, 2024	Image SegmentationSegmentation	—Unverified
Shifted Chunk Transformer for Spatio-Temporal Representational Learning	Aug 26, 2021	Action AnticipationAction Recognition	—Unverified

Show:10 25 50

← PrevPage 25 of 36Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified