Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 376–400 of 895 papers

Title	Date	Tasks	Status	Hype
TarViS: A Unified Approach for Target-based Video Segmentation	Jan 6, 2023	Instance SegmentationPanoptic Segmentation	CodeCode Available	1
Object Segmentation with Audio Context	Jan 4, 2023	audio-visual learningDecoder	—Unverified	0
SegGPT: Towards Segmenting Everything in Context	Jan 1, 2023	Few-Shot Semantic SegmentationIn-Context Learning	—Unverified	0
Unsupervised Video Object Segmentation with Online Adversarial Self-Tuning	Jan 1, 2023	ObjectPseudo Label	—Unverified	0
Video State-Changing Object Segmentation	Jan 1, 2023	ObjectRepresentation Learning	CodeCode Available	0
Robust Referring Video Object Segmentation with Cyclic Structural Consensus	Jan 1, 2023	ObjectReferring Video Object Segmentation	—Unverified	0
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation	Jan 1, 2023	multimodal interactionObject	—Unverified	0
Video Object Segmentation-aware Video Frame Interpolation	Jan 1, 2023	ObjectPose Estimation	CodeCode Available	1
Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation	Jan 1, 2023	RetrievalSemantic Segmentation	—Unverified	0
Segment Every Reference Object in Spatial and Temporal Spaces	Jan 1, 2023	Image SegmentationObject	—Unverified	0
End-to-End Video Matting With Trimap Propagation	Jan 1, 2023	Image MattingSegmentation	CodeCode Available	1
Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline	Jan 1, 2023	SegmentationSemantic Segmentation	CodeCode Available	1
Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation	Jan 1, 2023	Pseudo LabelSemantic Segmentation	—Unverified	0
Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation	Jan 1, 2023	Instance SegmentationMulti-Object Tracking	CodeCode Available	1
NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation	Jan 1, 2023	Video SegmentationVideo Semantic Segmentation	CodeCode Available	0
A Class-wise Non-salient Region Generalized Framework for Video Semantic Segmentation	Dec 29, 2022	Domain GeneralizationSegmentation	—Unverified	0
1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation	Dec 27, 2022	ObjectReferring Video Object Segmentation	CodeCode Available	1
Video Segmentation Learning Using Cascade Residual Convolutional Neural Network	Dec 20, 2022	Action RecognitionAnomaly Detection	—Unverified	0
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy	Dec 17, 2022	MisconceptionsObject	—Unverified	0
Learning a Fast 3D Spectral Approach to Object Segmentation and Tracking over Space and Time	Dec 15, 2022	ClusteringGPU	—Unverified	0
Look Before You Match: Instance Understanding Matters in Video Object Segmentation	Dec 13, 2022	Instance SegmentationSegmentation	—Unverified	0
Breaking the "Object" in Video Object Segmentation	Dec 12, 2022	ObjectSemantic Segmentation	—Unverified	0
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation	Dec 9, 2022	Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION	—Unverified	0
Video Object of Interest Segmentation	Dec 6, 2022	DecoderObject	—Unverified	0
Learning to Learn Better for Video Object Segmentation	Dec 5, 2022	Inductive LearningObject	CodeCode Available	1

Show:10 25 50

← PrevPage 16 of 36Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified