Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 551–600 of 895 papers

Title	Date	Tasks	Status
Motion-Corrected Moving Average: Including Post-Hoc Temporal Information for Improved Video Segmentation	Mar 5, 2024	Optical Flow EstimationSegmentation	—Unverified
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level	Nov 15, 2024	Benchmarkingcounterfactual	—Unverified
Motion-Guided Cascaded Refinement Network for Video Object Segmentation	Jun 1, 2018	ObjectOptical Flow Estimation	—Unverified
Motion-inductive Self-supervised Object Discovery in Videos	Oct 1, 2022	ObjectObject Discovery	—Unverified
Motion Prediction in Visual Object Tracking	Jul 1, 2020	Autonomous Drivingmotion prediction	—Unverified
Motion-state Alignment for Video Semantic Segmentation	Apr 18, 2023	Semantic SegmentationVideo Semantic Segmentation	—Unverified
Moving Object Proposals with Deep Learned Optical Flow for Video Object Segmentation	Feb 14, 2024	DecoderObject	—Unverified
Moving Object Segmentation in Jittery Videos by Stabilizing Trajectories Modeled in Kendall's Shape Space	Aug 14, 2018	ClusteringObject	—Unverified
MSU-Net: Multiscale Statistical U-Net for Real-time 3D Cardiac MRI Video Segmentation	Sep 15, 2019	SegmentationVideo Segmentation	—Unverified
Multiclass Semantic Video Segmentation With Object-Level Active Inference	Jun 1, 2015	ObjectSegmentation	—Unverified
Multi-class Video Co-segmentation with a Generative Multi-video Model	Jun 1, 2013	SegmentationVideo Segmentation	—Unverified
Multi-Cue Structure Preserving MRF for Unconstrained Video Segmentation	Jun 30, 2015	SegmentationSuperpixels	—Unverified
Multi-Level Representation Learning With Semantic Alignment for Referring Video Object Segmentation	Jan 1, 2022	ObjectReferring Expression Segmentation	—Unverified
Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries	Dec 2, 2018	Action LocalizationNatural Language Queries	—Unverified
Multimodal Segmentation for Vocal Tract Modeling	Jun 22, 2024	SegmentationVideo Segmentation	—Unverified
Multi-Object Tracking and Segmentation with a Space-Time Memory Network	Oct 21, 2021	Instance SegmentationMulti-Object Tracking	—Unverified
Multi-person Physics-based Pose Estimation for Combat Sports	Apr 11, 2025	3D Human Pose Estimation3D Multi-Person Pose Estimation	—Unverified
Multiresolution hierarchy co-clustering for semantic segmentation in sequences with small variations	Oct 16, 2015	Boundary DetectionClustering	—Unverified
Multi-stream CNN based Video Semantic Segmentation for Automated Driving	Jan 8, 2019	DecoderSemantic Segmentation	—Unverified
MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation	Nov 29, 2021	ObjectSemantic Segmentation	—Unverified
MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation	Jul 10, 2025	NeRFObject	—Unverified
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning	May 17, 2018	Action SegmentationIncremental Learning	—Unverified
Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation	Oct 19, 2020	Semantic SegmentationVideo Segmentation	—Unverified
No More Shortcuts: Realizing the Potential of Temporal Self-Supervision	Dec 20, 2023	Action ClassificationAttribute	—Unverified
Non-parametric Contextual Relationship Learning for Semantic Video Object Segmentation	Jul 8, 2024	Semantic SegmentationVideo Object Segmentation	—Unverified
Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2	Aug 8, 2024	Image SegmentationMedical Image Analysis	—Unverified
Novel tile segmentation scheme for omnidirectional video	Mar 10, 2021	Video SegmentationVideo Semantic Segmentation	—Unverified
Object Detection, Tracking, and Motion Segmentation for Object-level Video Segmentation	Aug 10, 2016	Motion SegmentationObject	—Unverified
Object Segmentation Tracking from Generic Video Cues	Oct 5, 2019	ObjectOptical Flow Estimation	—Unverified
Object Segmentation with Audio Context	Jan 4, 2023	audio-visual learningDecoder	—Unverified
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation	Mar 10, 2025	Pseudo LabelSemantic Segmentation	—Unverified
One-shot Training for Video Object Segmentation	May 22, 2024	ObjectSemantic Segmentation	—Unverified
One-Shot Video Inpainting	Feb 28, 2023	ObjectSegmentation	—Unverified
One-Shot Weakly Supervised Video Object Segmentation	Dec 18, 2019	ObjectSegmentation	—Unverified
OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework	Mar 13, 2024	AllManagement	—Unverified
On guiding video object segmentation	Apr 25, 2019	Foreground SegmentationObject	—Unverified
Online Adaptation of Convolutional Neural Networks for Video Object Segmentation	Jun 28, 2017	ObjectSegmentation	—Unverified
Online Reasoning Video Segmentation with Just-in-Time Digital Twins	Mar 27, 2025	Reasoning SegmentationVideo Segmentation	—Unverified
Online Video Object Segmentation via Convolutional Trident Network	Jul 1, 2017	ObjectOptical Flow Estimation	—Unverified
Open-World Skill Discovery from Unsegmented Demonstrations	Mar 11, 2025	Boundary DetectionEvent Segmentation	—Unverified
OVSNet : Towards One-Pass Real-Time Video Object Segmentation	May 24, 2019	Objectobject-detection	—Unverified
Parameter-free Video Segmentation for Vision and Language Understanding	Mar 3, 2025	Question AnsweringVideo Question Answering	—Unverified
Saliency-Aware Geodesic Video Object Segmentation	Jun 1, 2015	ObjectSegmentation	—Unverified
Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions	Aug 8, 2024	Information RetrievalSaliency Detection	—Unverified
Saliency-Motion Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation	Apr 8, 2025	Optical Flow EstimationSalient Object Detection	—Unverified
SAM2 for Image and Video Segmentation: A Comprehensive Survey	Mar 17, 2025	Autonomous DrivingImage Segmentation	—Unverified
SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation	Aug 8, 2024	DecoderInteractive Segmentation	—Unverified
SANPO: A Scene Understanding, Accessibility and Human Navigation Dataset	Sep 21, 2023	Autonomous VehiclesDepth Estimation	—Unverified
Scalable Video Object Segmentation with Simplified Framework	Aug 19, 2023	ObjectSemantic Segmentation	—Unverified
ScribbleBox: Interactive Annotation Framework for Video Object Segmentation	Aug 22, 2020	ObjectSegmentation	—Unverified

Show:10 25 50

← PrevPage 12 of 18Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified