Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–275 of 895 papers

Title	Date	Tasks	Status	Hype
DeepPyramid+: Medical Image Segmentation using Pyramid View Fusion and Deformable Pyramid Reception	Dec 6, 2023	Image SegmentationMedical Image Segmentation	—Unverified	0
Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning	Dec 1, 2023	Decoderobject-detection	CodeCode Available	1
SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation	Nov 30, 2023	Objectobject-detection	—Unverified	0
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models	Nov 30, 2023	Semantic SegmentationVideo Editing	—Unverified	0
A Simple Video Segmenter by Tracking Objects Along Axial Trajectories	Nov 30, 2023	GPUObject	CodeCode Available	1
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation	Nov 29, 2023	ClusteringObject	CodeCode Available	1
SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation	Nov 24, 2023	Meta-LearningOne-Shot Segmentation	CodeCode Available	1
Unified Domain Adaptive Semantic Segmentation	Nov 22, 2023	Data AugmentationOptical Flow Estimation	CodeCode Available	1
DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields	Nov 18, 2023	DecoderPoint Cloud Segmentation	CodeCode Available	0
Correlation-aware active learning for surgery video segmentation	Nov 15, 2023	Active LearningContrastive Learning	—Unverified	0
Sketch-based Video Object Segmentation: Benchmark and Analysis	Nov 13, 2023	ObjectSegmentation	—Unverified	0
Learning the What and How of Annotation in Video Object Segmentation	Nov 8, 2023	SegmentationSemantic Segmentation	—Unverified	0
ISAR: A Benchmark for Single- and Few-Shot Object Instance Segmentation and Re-Identification	Nov 5, 2023	Instance SegmentationMulti-Object Tracking	—Unverified	0
Concatenated Masked Autoencoders as Spatial-Temporal Learner	Nov 2, 2023	Action RecognitionData Augmentation	CodeCode Available	1
Mask Propagation for Efficient Video Semantic Segmentation	Oct 29, 2023	Semantic SegmentationVideo Semantic Segmentation	CodeCode Available	1
SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution	Oct 23, 2023	ObjectSemantic Segmentation	—Unverified	0
Putting the Object Back into Video Object Segmentation	Oct 19, 2023	ObjectSegmentation	CodeCode Available	3
Understanding Video Transformers for Segmentation: A Survey of Application and Interpretability	Oct 18, 2023	SegmentationVideo Segmentation	—Unverified	0
Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models	Oct 10, 2023	ObjectObject Tracking	—Unverified	0
Sub-token ViT Embedding via Stochastic Resonance Transformers	Oct 6, 2023	Depth EstimationDepth Prediction	CodeCode Available	0
CoralVOS: Dataset and Benchmark for Coral Video Segmentation	Oct 3, 2023	SegmentationSemantic Segmentation	—Unverified	0
SimLVSeg: Simplifying Left Ventricular Segmentation in 2D+Time Echocardiograms with Self- and Weakly-Supervised Learning	Sep 30, 2023	Left Ventricle SegmentationLV Segmentation	CodeCode Available	0
Memory-Efficient Continual Learning Object Segmentation for Long Video	Sep 26, 2023	Continual LearningObject	—Unverified	0
Treating Motion as Option with Output Selection for Unsupervised Video Object Segmentation	Sep 26, 2023	ObjectOptical Flow Estimation	CodeCode Available	1
Adversarial Attacks on Video Object Segmentation with Hard Region Discovery	Sep 25, 2023	Autonomous DrivingObject	—Unverified	0

Show:10 25 50

← PrevPage 11 of 36Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified