Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 701–750 of 895 papers

Title	Date	Tasks	Status
Memory-Efficient Continual Learning Object Segmentation for Long Video	Sep 26, 2023	Continual LearningObject	—Unverified
Memory Matching is not Enough: Jointly Improving Memory Matching and Decoding for Video Object Segmentation	Sep 22, 2024	Semantic SegmentationSemi-Supervised Video Object Segmentation	—Unverified
Memory Selection Network for Video Propagation	Aug 1, 2020	ColorizationSemantic Segmentation	—Unverified
MeNToS: Tracklets Association with a Space-Time Memory Network	Jul 15, 2021	Instance SegmentationMulti-Object Tracking	—Unverified
Meta Learning with Differentiable Closed-form Solver for Fast Video Object Segmentation	Sep 28, 2019	FormMeta-Learning	—Unverified
Mining Minimal Map-Segments for Visual Place Classifiers	Sep 15, 2019	SegmentationVideo Segmentation	—Unverified
MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation	Jun 27, 2024	Anomaly DetectionGraph Generation	—Unverified
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation	Mar 14, 2023	Contrastive LearningKnowledge Distillation	—Unverified
MoNet: Deep Motion Exploitation for Video Object Segmentation	Jun 1, 2018	ObjectOptical Flow Estimation	—Unverified
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection	Apr 30, 2025	Instance SegmentationInteractive Segmentation	—Unverified
Motion-Corrected Moving Average: Including Post-Hoc Temporal Information for Improved Video Segmentation	Mar 5, 2024	Optical Flow EstimationSegmentation	—Unverified
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level	Nov 15, 2024	Benchmarkingcounterfactual	—Unverified
Motion-Guided Cascaded Refinement Network for Video Object Segmentation	Jun 1, 2018	ObjectOptical Flow Estimation	—Unverified
Motion-inductive Self-supervised Object Discovery in Videos	Oct 1, 2022	ObjectObject Discovery	—Unverified
Motion Prediction in Visual Object Tracking	Jul 1, 2020	Autonomous Drivingmotion prediction	—Unverified
Motion-state Alignment for Video Semantic Segmentation	Apr 18, 2023	Semantic SegmentationVideo Semantic Segmentation	—Unverified
Moving Object Proposals with Deep Learned Optical Flow for Video Object Segmentation	Feb 14, 2024	DecoderObject	—Unverified
Moving Object Segmentation in Jittery Videos by Stabilizing Trajectories Modeled in Kendall's Shape Space	Aug 14, 2018	ClusteringObject	—Unverified
MSU-Net: Multiscale Statistical U-Net for Real-time 3D Cardiac MRI Video Segmentation	Sep 15, 2019	SegmentationVideo Segmentation	—Unverified
Multiclass Semantic Video Segmentation With Object-Level Active Inference	Jun 1, 2015	ObjectSegmentation	—Unverified
Multi-class Video Co-segmentation with a Generative Multi-video Model	Jun 1, 2013	SegmentationVideo Segmentation	—Unverified
Multi-Cue Structure Preserving MRF for Unconstrained Video Segmentation	Jun 30, 2015	SegmentationSuperpixels	—Unverified
Multi-Level Representation Learning With Semantic Alignment for Referring Video Object Segmentation	Jan 1, 2022	ObjectReferring Expression Segmentation	—Unverified
Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries	Dec 2, 2018	Action LocalizationNatural Language Queries	—Unverified
Multimodal Segmentation for Vocal Tract Modeling	Jun 22, 2024	SegmentationVideo Segmentation	—Unverified
Multi-Object Tracking and Segmentation with a Space-Time Memory Network	Oct 21, 2021	Instance SegmentationMulti-Object Tracking	—Unverified
Multi-person Physics-based Pose Estimation for Combat Sports	Apr 11, 2025	3D Human Pose Estimation3D Multi-Person Pose Estimation	—Unverified
Multiresolution hierarchy co-clustering for semantic segmentation in sequences with small variations	Oct 16, 2015	Boundary DetectionClustering	—Unverified
Multi-stream CNN based Video Semantic Segmentation for Automated Driving	Jan 8, 2019	DecoderSemantic Segmentation	—Unverified
MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation	Nov 29, 2021	ObjectSemantic Segmentation	—Unverified
MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation	Jul 10, 2025	NeRFObject	—Unverified
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning	May 17, 2018	Action SegmentationIncremental Learning	—Unverified
Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation	Oct 19, 2020	Semantic SegmentationVideo Segmentation	—Unverified
No More Shortcuts: Realizing the Potential of Temporal Self-Supervision	Dec 20, 2023	Action ClassificationAttribute	—Unverified
Non-parametric Contextual Relationship Learning for Semantic Video Object Segmentation	Jul 8, 2024	Semantic SegmentationVideo Object Segmentation	—Unverified
Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2	Aug 8, 2024	Image SegmentationMedical Image Analysis	—Unverified
Novel tile segmentation scheme for omnidirectional video	Mar 10, 2021	Video SegmentationVideo Semantic Segmentation	—Unverified
Object Detection, Tracking, and Motion Segmentation for Object-level Video Segmentation	Aug 10, 2016	Motion SegmentationObject	—Unverified
Object Segmentation Tracking from Generic Video Cues	Oct 5, 2019	ObjectOptical Flow Estimation	—Unverified
Object Segmentation with Audio Context	Jan 4, 2023	audio-visual learningDecoder	—Unverified
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation	Mar 10, 2025	Pseudo LabelSemantic Segmentation	—Unverified
One-shot Training for Video Object Segmentation	May 22, 2024	ObjectSemantic Segmentation	—Unverified
One-Shot Video Inpainting	Feb 28, 2023	ObjectSegmentation	—Unverified
One-Shot Weakly Supervised Video Object Segmentation	Dec 18, 2019	ObjectSegmentation	—Unverified
OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework	Mar 13, 2024	AllManagement	—Unverified
On guiding video object segmentation	Apr 25, 2019	Foreground SegmentationObject	—Unverified
Online Adaptation of Convolutional Neural Networks for Video Object Segmentation	Jun 28, 2017	ObjectSegmentation	—Unverified
Online Reasoning Video Segmentation with Just-in-Time Digital Twins	Mar 27, 2025	Reasoning SegmentationVideo Segmentation	—Unverified
Online Video Object Segmentation via Convolutional Trident Network	Jul 1, 2017	ObjectOptical Flow Estimation	—Unverified
Open-World Skill Discovery from Unsegmented Demonstrations	Mar 11, 2025	Boundary DetectionEvent Segmentation	—Unverified

Show:10 25 50

← PrevPage 15 of 18Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified