Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–650 of 895 papers

Title	Date	Tasks	Status
FODVid: Flow-guided Object Discovery in Videos	Jul 10, 2023	ObjectObject Discovery	—Unverified
FOMTrace: Interactive Video Segmentation By Image Graphs and Fuzzy Object Models	Jun 10, 2016	ObjectObject Tracking	—Unverified
FoodMem: Near Real-time and Precise Food Video Segmentation	Jul 16, 2024	SegmentationSemantic Segmentation	—Unverified
Fully Automated 2D and 3D Convolutional Neural Networks Pipeline for Video Segmentation and Myocardial Infarction Detection in Echocardiography	Mar 26, 2021	Binary ClassificationMyocardial infarction detection	—Unverified
Fully Connected Object Proposals for Video Segmentation	Dec 1, 2015	ObjectSegmentation	—Unverified
Fully Hyperbolic Convolutional Neural Networks	May 24, 2019	Depth EstimationGeneral Classification	—Unverified
Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation	Sep 21, 2023	ObjectReferring Video Object Segmentation	—Unverified
FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos	Jan 19, 2017	SegmentationStructured Prediction	—Unverified
FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos	Jul 1, 2017	SegmentationStructured Prediction	—Unverified
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution	Apr 13, 2025	SegmentationSemantic Segmentation	—Unverified
Gamifying Video Object Segmentation	Jan 5, 2016	Interactive Video Object SegmentationObject	—Unverified
GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting	Nov 12, 2024	3DGSgraph construction	—Unverified
GenDeF: Learning Generative Deformation Field for Video Generation	Dec 7, 2023	DisentanglementVideo Editing	—Unverified
Generalized Product-of-Experts for Learning Multimodal Representations in Noisy Environments	Nov 7, 2022	3D Hand Pose EstimationHand Pose Estimation	—Unverified
Generating Masks from Boxes by Mining Spatio-Temporal Consistencies in Videos	Jan 6, 2021	ObjectSegmentation	—Unverified
Generative Video Propagation	Dec 27, 2024	Image to Video GenerationVideo Generation	—Unverified
Geodesic Distance Histogram Feature for Video Segmentation	Mar 31, 2017	SegmentationSuperpixels	—Unverified
Geometric Algebra Planes: Convex Implicit Neural Volumes	Nov 20, 2024	DecoderVideo Segmentation	—Unverified
Geometric Context from Videos	Oct 25, 2015	SegmentationVideo Segmentation	—Unverified
Global Motion Understanding in Large-Scale Video Object Segmentation	May 11, 2024	Instance SegmentationOptical Flow Estimation	—Unverified
Global Optimality Guarantees for Nonconvex Unsupervised Video Segmentation	Jul 9, 2019	ObjectSegmentation	—Unverified
GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation	Jun 18, 2024	Contrastive LearningObject	—Unverified
Grouping-Based Low-Rank Trajectory Completion and 3D Reconstruction	Dec 1, 2014	3D ReconstructionClustering	—Unverified
Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion	May 16, 2022	Image SegmentationOptical Flow Estimation	—Unverified
HD-EPIC: A Highly-Detailed Egocentric Video Dataset	Feb 6, 2025	Action RecognitionNutrition	—Unverified
Hierarchical interaction network for video object segmentation from referring expressions	Nov 22, 2021	Optical Flow EstimationReferring Expression Segmentation	—Unverified
Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation	Aug 24, 2022	Hierarchical Reinforcement Learningreinforcement-learning	—Unverified
Hierarchical Spatiotemporal Transformers for Video Object Segmentation	Jul 17, 2023	Inductive BiasObject	—Unverified
Hierarchical Video Representation with Trajectory Binary Partition Tree	Jun 1, 2013	Video SegmentationVideo Semantic Segmentation	—Unverified
High Fidelity Interactive Video Segmentation Using Tensor Decomposition Boundary Loss Convolutional Tessellations and Context Aware Skip Connections	Nov 23, 2020	Interactive SegmentationSegmentation	—Unverified
Highway Driving Dataset for Semantic Video Segmentation	Nov 2, 2020	Autonomous DrivingImage Segmentation	—Unverified
Tamed Warping Network for High-Resolution Semantic Video Segmentation	May 4, 2020	Motion EstimationReal-Time Semantic Segmentation	—Unverified
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation	Jan 1, 2023	multimodal interactionObject	—Unverified
Human Instance Segmentation and Tracking via Data Association and Single-stage Detector	Mar 31, 2022	Human Instance SegmentationInstance Segmentation	—Unverified
Image Segmentation by Uniform Color Clustering Approach and Benchmark Results	Jun 3, 2005	ClusteringImage Retrieval	—Unverified
I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data	Jun 10, 2024	NavigateObject	—Unverified
Improved Image Boundaries for Better Video Segmentation	May 12, 2016	graph partitioningSegmentation	—Unverified
Improving Streaming Video Segmentation with Early and Mid-Level Visual Processing	Feb 14, 2014	Motion SegmentationSegmentation	—Unverified
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy	Dec 17, 2022	MisconceptionsObject	—Unverified
Improving Unsupervised Video Object Segmentation via Fake Flow Generation	Jul 16, 2024	Objectobject-detection	—Unverified
In defense of OSVOS	Aug 19, 2019	Depth EstimationObject	—Unverified
Instance Embedding Transfer to Unsupervised Video Object Segmentation	Jan 3, 2018	ObjectOptical Flow Estimation	—Unverified
Instance-Level Video Segmentation From Object Tracks	Jun 1, 2016	ClusteringObject	—Unverified
Monocular Instance Motion Segmentation for Autonomous Driving: KITTI InstanceMotSeg Dataset and Multi-task Baseline	Aug 16, 2020	Autonomous DrivingAutonomous Vehicles	—Unverified
Interactive Video Object Segmentation in the Wild	Dec 31, 2017	Image SegmentationInteractive Video Object Segmentation	—Unverified
InterRVOS: Interaction-aware Referring Video Object Segmentation	Jun 3, 2025	8kObject	—Unverified
Investigation of Frame Differences as Motion Cues for Video Object Segmentation	Mar 12, 2025	Optical Flow EstimationSegmentation	—Unverified
ISAR: A Benchmark for Single- and Few-Shot Object Instance Segmentation and Re-Identification	Nov 5, 2023	Instance SegmentationMulti-Object Tracking	—Unverified
ISEC: Iterative over-Segmentation via Edge Clustering	Feb 16, 2018	ClusteringSegmentation	—Unverified
Is SAM 2 Better than SAM in Medical Image Segmentation?	Aug 8, 2024	Image SegmentationMedical Image Segmentation	—Unverified

Show:10 25 50

← PrevPage 13 of 18Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified