Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 895 papers

Title	Date	Tasks	Status	Score
UVid-Net: Enhanced Semantic Segmentation of UAV Aerial Videos by Embedding Temporal Information	Nov 29, 2020	Aerial Video Semantic SegmentationDecision Making	CodeCode Available	5
Anchor Diffusion for Unsupervised Video Object Segmentation	Oct 24, 2019	Image SegmentationObject	CodeCode Available	5
Weakly Supervised Energy-Based Learning for Action Segmentation	Sep 28, 2019	Action SegmentationSegmentation	CodeCode Available	5
Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes	Jan 27, 2024	Motion EstimationSegmentation	CodeCode Available	5
Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation	Sep 14, 2023	ClassificationDecoder	CodeCode Available	5
A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation	Jun 1, 2016	SegmentationSemantic Segmentation	CodeCode Available	5
MoDA: Leveraging Motion Priors from Videos for Advancing Unsupervised Domain Adaptation in Semantic Segmentation	Sep 21, 2023	Domain AdaptationImage Segmentation	CodeCode Available	5
NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation	Jan 1, 2023	Video SegmentationVideo Semantic Segmentation	CodeCode Available	5
Exploiting Temporality for Semi-Supervised Video Segmentation	Aug 29, 2019	DecoderImage Segmentation	CodeCode Available	5
An Image Processing Pipeline for Camera Trap Time-Lapse Recordings	Jun 10, 2022	BIG-bench Machine LearningVideo Segmentation	CodeCode Available	5
Video Semantic Segmentation with Inter-Frame Feature Fusion and Inner-Frame Feature Refinement	Jan 10, 2023	Optical Flow EstimationSemantic Segmentation	CodeCode Available	5
Video State-Changing Object Segmentation	Jan 1, 2023	ObjectRepresentation Learning	CodeCode Available	5
MHP-VOS: Multiple Hypotheses Propagation for Video Object Segmentation	Apr 17, 2019	Decision MakingObject	CodeCode Available	5
Temporal Transductive Inference for Few-Shot Video Object Segmentation	Mar 27, 2022	Meta-LearningObject	CodeCode Available	5
CLVOS23: A Long Video Object Segmentation Dataset for Continual Learning	Apr 9, 2023	Continual LearningSemantic Segmentation	CodeCode Available	5
Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation	Jul 15, 2023	DecoderSegmentation	CodeCode Available	5
Strike the Balance: On-the-Fly Uncertainty based User Interactions for Long-Term Video Object Segmentation	Jul 31, 2024	ObjectSegmentation	CodeCode Available	5
When SAM2 Meets Video Shadow and Mirror Detection	Dec 26, 2024	Image SegmentationMirror Detection	CodeCode Available	5
DTOS: Dynamic Time Object Sensing with Large Multimodal Model	Jan 1, 2025	Moment RetrievalReferring Video Object Segmentation	CodeCode Available	5
Borrowing from yourself: Faster future video segmentation with partial channel update	Feb 11, 2022	Future predictionSemantic Segmentation	CodeCode Available	5
Fully Convolutional Networks for Semantic Segmentation	May 20, 2016	Real-Time Semantic SegmentationScene Segmentation	CodeCode Available	5
Fast Interactive Video Object Segmentation with Graph Neural Networks	Mar 5, 2021	Graph Neural NetworkInteractive Video Object Segmentation	CodeCode Available	5
Learning Unsupervised Video Object Segmentation Through Visual Attention	Jun 1, 2019	ObjectSegmentation	CodeCode Available	5
UAVid: A Semantic Segmentation Dataset for UAV Imagery	Oct 24, 2018	4kAutonomous Driving	CodeCode Available	5
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts	May 24, 2025	Image SegmentationInstance Segmentation	CodeCode Available	5
Video Decomposition Prior: A Methodology to Decompose Videos into Layers	Dec 6, 2024	Semantic SegmentationVideo Editing	—Unverified	0
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos	Nov 7, 2024	DecoderLanguage Modeling	—Unverified	0
VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos	Jan 1, 2025	Large Language ModelVideo Segmentation	—Unverified	0
Video Human Segmentation using Fuzzy Object Models and its Application to Body Pose Estimation of Toddlers for Behavior Studies	May 29, 2013	ObjectPose Estimation	—Unverified	0
Video Instance Segmentation Tracking With a Modified VAE Architecture	Jun 1, 2020	Instance Segmentationobject-detection	—Unverified	0
Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation	Apr 10, 2023	Panoptic SegmentationScene Understanding	—Unverified	0
VideoMatch: Matching based Video Object Segmentation	Sep 4, 2018	MemorizationObject	—Unverified	0
Video Object of Interest Segmentation	Dec 6, 2022	DecoderObject	—Unverified	0
Video Object Segmentation and Tracking: A Survey	Apr 19, 2019	Autonomous VehiclesObject	—Unverified	0
Video Object Segmentation by Learning Location-Sensitive Embeddings	Sep 1, 2018	ObjectSemantic Segmentation	—Unverified	0
Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions	Jun 1, 2013	ObjectOptical Flow Estimation	—Unverified	0
Video Object Segmentation Using Global and Instance Embedding Learning	Jun 19, 2021	ObjectRelation	—Unverified	0
Video Object Segmentation using Tracked Object Proposals	Jul 20, 2017	Objectobject-detection	—Unverified	0
Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track	Aug 19, 2024	ObjectSegmentation	—Unverified	0
Video Object Segmentation with Joint Re-identification and Attention-Aware Mask Propagation	Mar 12, 2018	Semantic SegmentationVideo Object Segmentation	—Unverified	0
Video Object Segmentation with Language Referring Expressions	Mar 21, 2018	ObjectReferring Expression Segmentation	—Unverified	0
Video Object Segmentation Without Temporal Information	Sep 18, 2017	Foreground SegmentationObject	—Unverified	0
Video Propagation Networks	Dec 16, 2016	SegmentationSemantic Segmentation	—Unverified	0
Video Salient Object Detection Using Spatiotemporal Deep Features	Aug 4, 2017	Objectobject-detection	—Unverified	0
Video Salient Object Detection via Contrastive Features and Attention Modules	Nov 3, 2021	Contrastive LearningObject	—Unverified	0
VideoSAM: Open-World Video Segmentation	Oct 11, 2024	Autonomous DrivingDecoder	—Unverified	0
Video Segmentation Learning Using Cascade Residual Convolutional Neural Network	Dec 20, 2022	Action RecognitionAnomaly Detection	—Unverified	0
Video Segmentation via Diffusion Bases	May 1, 2013	SegmentationVideo Segmentation	—Unverified	0
Video Segmentation via Multiple Granularity Analysis	Jul 1, 2017	Multiple Instance LearningSegmentation	—Unverified	0
Video Segmentation via Object Flow	Jun 1, 2016	ObjectOptical Flow Estimation	—Unverified	0

Show:10 25 50

← PrevPage 9 of 18Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
3	TDNet-50 [9]	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified