Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 576–600 of 895 papers

Title	Date	Tasks	Status
Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2	Aug 8, 2024	Image SegmentationMedical Image Analysis	—Unverified
Novel tile segmentation scheme for omnidirectional video	Mar 10, 2021	Video SegmentationVideo Semantic Segmentation	—Unverified
Object Detection, Tracking, and Motion Segmentation for Object-level Video Segmentation	Aug 10, 2016	Motion SegmentationObject	—Unverified
Object Segmentation Tracking from Generic Video Cues	Oct 5, 2019	ObjectOptical Flow Estimation	—Unverified
Object Segmentation with Audio Context	Jan 4, 2023	audio-visual learningDecoder	—Unverified
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation	Mar 10, 2025	Pseudo LabelSemantic Segmentation	—Unverified
One-shot Training for Video Object Segmentation	May 22, 2024	ObjectSemantic Segmentation	—Unverified
One-Shot Video Inpainting	Feb 28, 2023	ObjectSegmentation	—Unverified
One-Shot Weakly Supervised Video Object Segmentation	Dec 18, 2019	ObjectSegmentation	—Unverified
OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework	Mar 13, 2024	AllManagement	—Unverified
On guiding video object segmentation	Apr 25, 2019	Foreground SegmentationObject	—Unverified
Online Adaptation of Convolutional Neural Networks for Video Object Segmentation	Jun 28, 2017	ObjectSegmentation	—Unverified
Online Reasoning Video Segmentation with Just-in-Time Digital Twins	Mar 27, 2025	Reasoning SegmentationVideo Segmentation	—Unverified
Online Video Object Segmentation via Convolutional Trident Network	Jul 1, 2017	ObjectOptical Flow Estimation	—Unverified
Open-World Skill Discovery from Unsegmented Demonstrations	Mar 11, 2025	Boundary DetectionEvent Segmentation	—Unverified
OVSNet : Towards One-Pass Real-Time Video Object Segmentation	May 24, 2019	Objectobject-detection	—Unverified
Parameter-free Video Segmentation for Vision and Language Understanding	Mar 3, 2025	Question AnsweringVideo Question Answering	—Unverified
Saliency-Aware Geodesic Video Object Segmentation	Jun 1, 2015	ObjectSegmentation	—Unverified
Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions	Aug 8, 2024	Information RetrievalSaliency Detection	—Unverified
Saliency-Motion Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation	Apr 8, 2025	Optical Flow EstimationSalient Object Detection	—Unverified
SAM2 for Image and Video Segmentation: A Comprehensive Survey	Mar 17, 2025	Autonomous DrivingImage Segmentation	—Unverified
SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation	Aug 8, 2024	DecoderInteractive Segmentation	—Unverified
SANPO: A Scene Understanding, Accessibility and Human Navigation Dataset	Sep 21, 2023	Autonomous VehiclesDepth Estimation	—Unverified
Scalable Video Object Segmentation with Simplified Framework	Aug 19, 2023	ObjectSemantic Segmentation	—Unverified
ScribbleBox: Interactive Annotation Framework for Video Object Segmentation	Aug 22, 2020	ObjectSegmentation	—Unverified

Show:10 25 50

← PrevPage 24 of 36Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified