Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 895 papers

Title	Date	Tasks	Status	Hype
Dual Prototype Attention for Unsupervised Video Object Segmentation	Nov 22, 2022	ObjectSemantic Segmentation	CodeCode Available	1
LVOS: A Benchmark for Long-term Video Object Segmentation	Nov 18, 2022	ObjectSemantic Segmentation	CodeCode Available	1
Robust Online Video Instance Segmentation with Track Queries	Nov 16, 2022	Image SegmentationInstance Segmentation	CodeCode Available	0
Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview	Nov 13, 2022	SegmentationSemantic Segmentation	—Unverified	0
Efficient Unsupervised Video Object Segmentation Network Based on Motion Guidance	Nov 10, 2022	object-detectionObject Detection	—Unverified	0
Generalized Product-of-Experts for Learning Multimodal Representations in Noisy Environments	Nov 7, 2022	3D Hand Pose EstimationHand Pose Estimation	—Unverified	0
Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing	Nov 4, 2022	Domain AdaptationSemantic Segmentation	CodeCode Available	1
Quantifying and Learning Static vs. Dynamic Information in Deep Spatiotemporal Networks	Nov 3, 2022	Action RecognitionInstance Segmentation	—Unverified	0
Two-Level Temporal Relation Model for Online Video Instance Segmentation	Oct 30, 2022	Graph Neural NetworkInstance Segmentation	CodeCode Available	0
Self-supervised Amodal Video Object Segmentation	Oct 23, 2022	ObjectSegmentation	CodeCode Available	0
Decoupling Features in Hierarchical Propagation for Video Object Segmentation	Oct 18, 2022	ObjectSemantic Segmentation	CodeCode Available	2
EISeg: An Efficient Interactive Segmentation Tool based on PaddlePaddle	Oct 17, 2022	Image SegmentationInteractive Segmentation	—Unverified	0
Global Spectral Filter Memory Network for Video Object Segmentation	Oct 11, 2022	AttributeDecoder	CodeCode Available	1
Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders	Oct 9, 2022	Representation LearningSemantic Segmentation	CodeCode Available	1
Motion-inductive Self-supervised Object Discovery in Videos	Oct 1, 2022	ObjectObject Discovery	—Unverified	0
EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations	Sep 26, 2022	ObjectSegmentation	CodeCode Available	1
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video	Sep 25, 2022	Long-tail Video Object SegmentationMulti-Object Tracking	CodeCode Available	1
Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward	Sep 25, 2022	DecoderVideo Editing	CodeCode Available	1
A Simple and Powerful Global Optimization for Unsupervised Video Object Segmentation	Sep 19, 2022	Clusteringglobal-optimization	CodeCode Available	1
MCIBI++: Soft Mining Contextual Information Beyond Image for Semantic Segmentation	Sep 9, 2022	SegmentationSemantic Segmentation	CodeCode Available	2
Unsupervised Video Object Segmentation via Prototype Memory Network	Sep 8, 2022	ObjectOptical Flow Estimation	CodeCode Available	1
Pixel-Level Equalized Matching for Video Object Segmentation	Sep 4, 2022	ObjectSemantic Segmentation	—Unverified	0
Treating Motion as Option to Reduce Motion Dependency in Unsupervised Video Object Segmentation	Sep 4, 2022	Optical Flow EstimationSemantic Segmentation	CodeCode Available	1
TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut	Sep 1, 2022	Object DiscoverySaliency Detection	—Unverified	0
Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation	Aug 24, 2022	Hierarchical Reinforcement Learningreinforcement-learning	—Unverified	0
Visual Subtitle Feature Enhanced Video Outline Generation	Aug 24, 2022	ArticlesHeadline Generation	—Unverified	0
Efficient Heterogeneous Video Segmentation at the Edge	Aug 24, 2022	CPUGPU	—Unverified	0
SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization	Aug 22, 2022	Semantic SegmentationSemi-Supervised Video Object Segmentation	CodeCode Available	1
Two-Stream Networks for Object Segmentation in Videos	Aug 8, 2022	ObjectRetrieval	—Unverified	0
Per-Clip Video Object Segmentation	Aug 3, 2022	ObjectSegmentation	CodeCode Available	1
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation	Aug 1, 2022	ObjectOptical Flow Estimation	—Unverified	0
Multi-Attention Network for Compressed Video Referring Object Segmentation	Jul 26, 2022	ObjectReferring Expression Segmentation	CodeCode Available	1
Region Aware Video Object Segmentation with Deep Motion Modeling	Jul 21, 2022	DecoderObject	—Unverified	0
Mining Relations among Cross-Frame Affinities for Video Semantic Segmentation	Jul 21, 2022	Optical Flow EstimationSemantic Segmentation	CodeCode Available	1
Semantic-Aware Fine-Grained Correspondence	Jul 21, 2022	Pose TrackingSelf-Supervised Learning	CodeCode Available	1
In Defense of Online Models for Video Instance Segmentation	Jul 21, 2022	Contrastive LearningInstance Segmentation	CodeCode Available	2
Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations	Jul 18, 2022	object-detectionObject Detection	CodeCode Available	1
Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation	Jul 18, 2022	ObjectOptical Flow Estimation	CodeCode Available	1
Personalized PCA: Decoupling Shared and Unique Features	Jul 17, 2022	Video SegmentationVideo Semantic Segmentation	CodeCode Available	0
Learning Quality-aware Dynamic Memory for Video Object Segmentation	Jul 16, 2022	SegmentationSemantic Segmentation	CodeCode Available	1
MAC-DO: An Efficient Output-Stationary GEMM Accelerator for CNNs Using DRAM Technology	Jul 16, 2022	speech-recognitionSpeech Recognition	—Unverified	0
Tackling Background Distraction in Video Object Segmentation	Jul 14, 2022	ObjectSemantic Segmentation	CodeCode Available	1
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model	Jul 14, 2022	2D Human Pose Estimation2D Object Detection	CodeCode Available	3
Domain Adaptive Video Segmentation via Temporal Pseudo Supervision	Jul 6, 2022	SegmentationSemantic Segmentation	CodeCode Available	1
SiamMask: A Framework for Fast Online Object Tracking and Segmentation	Jul 5, 2022	Multiple Object TrackingObject	CodeCode Available	4
Towards Robust Referring Video Object Segmentation with Cyclic Relational Consensus	Jul 4, 2022	Referring Expression SegmentationReferring Video Object Segmentation	CodeCode Available	1
Towards Robust Video Object Segmentation with Adaptive Object Calibration	Jul 2, 2022	ObjectSegmentation	CodeCode Available	1
The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge--Track 3: Referring Video Object Segmentation	Jun 24, 2022	Objectobject-detection	—Unverified	0
Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation	Jun 20, 2022	Autonomous DrivingNetwork Pruning	—Unverified	0
5th Place Solution for YouTube-VOS Challenge 2022: Video Object Segmentation	Jun 20, 2022	ObjectSegmentation	—Unverified	0

Show:10 25 50

← PrevPage 9 of 18Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
3	TDNet-50 [9]	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified