Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–550 of 895 papers

Title	Date	Tasks	Status
Robust and Efficient Memory Network for Video Object Segmentation	Apr 24, 2023	ObjectSemantic Segmentation	—Unverified
Automatic Interaction and Activity Recognition from Videos of Human Manual Demonstrations with Application to Anomaly Detection	Apr 19, 2023	Activity RecognitionAnomaly Detection	—Unverified
Motion-state Alignment for Video Semantic Segmentation	Apr 18, 2023	Semantic SegmentationVideo Semantic Segmentation	—Unverified
MED-VT++: Unifying Multimodal Learning with a Multiscale Encoder-Decoder Video Transformer	Apr 12, 2023	Action SegmentationDecoder	—Unverified
Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation	Apr 10, 2023	Panoptic SegmentationScene Understanding	—Unverified
CLVOS23: A Long Video Object Segmentation Dataset for Continual Learning	Apr 9, 2023	Continual LearningSemantic Segmentation	CodeCode Available
Co-attention Propagation Network for Zero-Shot Video Object Segmentation	Apr 8, 2023	DecoderOptical Flow Estimation	CodeCode Available
Spatio-Temporal Pixel-Level Contrastive Learning-based Source-Free Domain Adaptation for Video Semantic Segmentation	Mar 25, 2023	Contrastive LearningDomain Adaptation	CodeCode Available
Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation	Mar 17, 2023	SegmentationSelf-Supervised Learning	CodeCode Available
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation	Mar 14, 2023	Contrastive LearningKnowledge Distillation	—Unverified
A Threefold Review on Deep Semantic Segmentation: Efficiency-oriented, Temporal and Depth-aware design	Mar 8, 2023	Autonomous DrivingAutonomous Vehicles	—Unverified
Tsanet: Temporal and Scale Alignment for Unsupervised Video Object Segmentation	Mar 8, 2023	DecoderObject	—Unverified
Learning to Adapt to Online Streams with Distribution Shifts	Mar 2, 2023	BenchmarkingMeta-Learning	—Unverified
One-Shot Video Inpainting	Feb 28, 2023	ObjectSegmentation	—Unverified
Approximating DTW with a convolutional neural network on EEG data	Jan 30, 2023	Anomaly DetectionComputational Efficiency	—Unverified
Maximal Cliques on Multi-Frame Proposal Graph for Unsupervised Video Object Segmentation	Jan 29, 2023	Instance SegmentationObject	—Unverified
Flow-guided Semi-supervised Video Object Segmentation	Jan 25, 2023	DecoderObject	—Unverified
A Comprehensive Review of Modern Object Segmentation Approaches	Jan 13, 2023	Image SegmentationObject	—Unverified
Video Semantic Segmentation with Inter-Frame Feature Fusion and Inner-Frame Feature Refinement	Jan 10, 2023	Optical Flow EstimationSemantic Segmentation	CodeCode Available
Object Segmentation with Audio Context	Jan 4, 2023	audio-visual learningDecoder	—Unverified
Robust Referring Video Object Segmentation with Cyclic Structural Consensus	Jan 1, 2023	ObjectReferring Video Object Segmentation	—Unverified
NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation	Jan 1, 2023	Video SegmentationVideo Semantic Segmentation	CodeCode Available
Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation	Jan 1, 2023	Pseudo LabelSemantic Segmentation	—Unverified
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation	Jan 1, 2023	multimodal interactionObject	—Unverified
Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation	Jan 1, 2023	RetrievalSemantic Segmentation	—Unverified
Unsupervised Video Object Segmentation with Online Adversarial Self-Tuning	Jan 1, 2023	ObjectPseudo Label	—Unverified
Segment Every Reference Object in Spatial and Temporal Spaces	Jan 1, 2023	Image SegmentationObject	—Unverified
Video State-Changing Object Segmentation	Jan 1, 2023	ObjectRepresentation Learning	CodeCode Available
SegGPT: Towards Segmenting Everything in Context	Jan 1, 2023	Few-Shot Semantic SegmentationIn-Context Learning	—Unverified
A Class-wise Non-salient Region Generalized Framework for Video Semantic Segmentation	Dec 29, 2022	Domain GeneralizationSegmentation	—Unverified
Video Segmentation Learning Using Cascade Residual Convolutional Neural Network	Dec 20, 2022	Action RecognitionAnomaly Detection	—Unverified
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy	Dec 17, 2022	MisconceptionsObject	—Unverified
Learning a Fast 3D Spectral Approach to Object Segmentation and Tracking over Space and Time	Dec 15, 2022	ClusteringGPU	—Unverified
Look Before You Match: Instance Understanding Matters in Video Object Segmentation	Dec 13, 2022	Instance SegmentationSegmentation	—Unverified
Breaking the "Object" in Video Object Segmentation	Dec 12, 2022	ObjectSemantic Segmentation	—Unverified
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation	Dec 9, 2022	Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION	—Unverified
Video Object of Interest Segmentation	Dec 6, 2022	DecoderObject	—Unverified
Robust Online Video Instance Segmentation with Track Queries	Nov 16, 2022	Image SegmentationInstance Segmentation	CodeCode Available
Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview	Nov 13, 2022	SegmentationSemantic Segmentation	—Unverified
Efficient Unsupervised Video Object Segmentation Network Based on Motion Guidance	Nov 10, 2022	object-detectionObject Detection	—Unverified
Generalized Product-of-Experts for Learning Multimodal Representations in Noisy Environments	Nov 7, 2022	3D Hand Pose EstimationHand Pose Estimation	—Unverified
Quantifying and Learning Static vs. Dynamic Information in Deep Spatiotemporal Networks	Nov 3, 2022	Action RecognitionInstance Segmentation	—Unverified
Two-Level Temporal Relation Model for Online Video Instance Segmentation	Oct 30, 2022	Graph Neural NetworkInstance Segmentation	CodeCode Available
Self-supervised Amodal Video Object Segmentation	Oct 23, 2022	ObjectSegmentation	CodeCode Available
EISeg: An Efficient Interactive Segmentation Tool based on PaddlePaddle	Oct 17, 2022	Image SegmentationInteractive Segmentation	—Unverified
Motion-inductive Self-supervised Object Discovery in Videos	Oct 1, 2022	ObjectObject Discovery	—Unverified
Pixel-Level Equalized Matching for Video Object Segmentation	Sep 4, 2022	ObjectSemantic Segmentation	—Unverified
TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut	Sep 1, 2022	Object DiscoverySaliency Detection	—Unverified
Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation	Aug 24, 2022	Hierarchical Reinforcement Learningreinforcement-learning	—Unverified
Efficient Heterogeneous Video Segmentation at the Edge	Aug 24, 2022	CPUGPU	—Unverified

Show:10 25 50

← PrevPage 11 of 18Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified