Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–700 of 895 papers

Title	Date	Tasks	Status	Hype
Learning Video Object Segmentation from Unlabeled Videos	Mar 10, 2020	ObjectRepresentation Learning	CodeCode Available	1
Motion-Attentive Transition for Zero-Shot Video Object Segmentation	Mar 9, 2020	DecoderObject	CodeCode Available	1
State-Aware Tracker for Real-Time Video Object Segmentation	Mar 1, 2020	SegmentationSemantic Segmentation	CodeCode Available	1
Learning Fast and Robust Target Models for Video Object Segmentation	Feb 27, 2020	One-shot visual object segmentationSegmentation	CodeCode Available	1
Unsupervised Temporal Video Segmentation as an Auxiliary Task for Predicting the Remaining Surgery Duration	Feb 26, 2020	Auxiliary LearningSurgical phase recognition	—Unverified	0
Efficient Semantic Video Segmentation with Per-frame Inference	Feb 26, 2020	Knowledge DistillationOptical Flow Estimation	CodeCode Available	1
MAST: A Memory-Augmented Self-supervised Tracker	Feb 18, 2020	Semantic SegmentationSemi-Supervised Video Object Segmentation	CodeCode Available	1
Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation	Feb 17, 2020	GPUOne-shot visual object segmentation	CodeCode Available	1
CRVOS: Clue Refining Network for Video Object Segmentation	Feb 10, 2020	DecoderObject	CodeCode Available	0
Fast Video Object Segmentation using the Global Context Module	Jan 30, 2020	ObjectSegmentation	CodeCode Available	1
Weakly Supervised Few-shot Object Segmentation using Co-Attention with Visual and Semantic Embeddings	Jan 26, 2020	Few-Shot LearningObject	—Unverified	0
Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks	Jan 19, 2020	Graph Neural NetworkSegmentation	CodeCode Available	1
See More, Know More: Unsupervised Video Object Segmentation with Co-Attention Siamese Networks	Jan 19, 2020	Semantic SegmentationUnsupervised Video Object Segmentation	CodeCode Available	1
UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking	Jan 15, 2020	ObjectSegmentation	CodeCode Available	1
Efficient Video Semantic Segmentation with Labels Propagation and Refinement	Dec 26, 2019	CPUGPU	—Unverified	0
One-Shot Weakly Supervised Video Object Segmentation	Dec 18, 2019	ObjectSegmentation	—Unverified	0
Symmetric block-low-rank layers for fully reversible multilevel neural networks	Dec 14, 2019	Video SegmentationVideo Semantic Segmentation	—Unverified	0
Automatic Video Object Segmentation via Motion-Appearance-Stream Fusion and Instance-aware Segmentation	Dec 3, 2019	Foreground SegmentationInstance Segmentation	—Unverified	0
Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow	Nov 28, 2019	Optical Flow EstimationSegmentation	—Unverified	0
D3S -- A Discriminative Single Shot Segmentation Tracker	Nov 20, 2019	ObjectObject Tracking	CodeCode Available	0
Sequential image processing methods for improving semantic video segmentation algorithms	Oct 29, 2019	Autonomous DrivingObject	—Unverified	0
Learning to Track Any Object	Oct 25, 2019	Instance SegmentationObject	—Unverified	0
Anchor Diffusion for Unsupervised Video Object Segmentation	Oct 24, 2019	Image SegmentationObject	CodeCode Available	0
Object Segmentation Tracking from Generic Video Cues	Oct 5, 2019	ObjectOptical Flow Estimation	—Unverified	0
SegEQA: Video Segmentation Based Visual Attention for Embodied Question Answering	Oct 1, 2019	Embodied Question AnsweringQuestion Answering	—Unverified	0
AGSS-VOS: Attention Guided Single-Shot Video Object Segmentation	Oct 1, 2019	DecoderObject	CodeCode Available	0
AdvIT: Adversarial Frames Identifier Based on Temporal Consistency in Videos	Oct 1, 2019	Action RecognitionAutonomous Driving	—Unverified	0
Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Query	Oct 1, 2019	Referring Expression SegmentationSegmentation	CodeCode Available	0
Fast Video Object Segmentation via Dynamic Targeting Network	Oct 1, 2019	ObjectSegmentation	—Unverified	0
Towards Good Practices for Video Object Segmentation	Sep 30, 2019	BIG-bench Machine LearningObject	—Unverified	0
CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing	Sep 30, 2019	ObjectOne-shot visual object segmentation	CodeCode Available	0
LIP: Learning Instance Propagation for Video Object Segmentation	Sep 30, 2019	Data AugmentationInstance Segmentation	—Unverified	0
RPM-Net: Robust Pixel-Level Matching Networks for Self-Supervised Video Object Segmentation	Sep 29, 2019	ObjectSegmentation	—Unverified	0
Weakly Supervised Energy-Based Learning for Action Segmentation	Sep 28, 2019	Action SegmentationSegmentation	CodeCode Available	0
Meta Learning with Differentiable Closed-form Solver for Fast Video Object Segmentation	Sep 28, 2019	FormMeta-Learning	—Unverified	0
DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation	Sep 27, 2019	ObjectOne-shot visual object segmentation	CodeCode Available	0
Adaptive ROI Generation for Video Object Segmentation Using Reinforcement Learning	Sep 27, 2019	reinforcement-learningReinforcement Learning	CodeCode Available	0
Mining Minimal Map-Segments for Visual Place Classifiers	Sep 15, 2019	SegmentationVideo Segmentation	—Unverified	0
MSU-Net: Multiscale Statistical U-Net for Real-time 3D Cardiac MRI Video Segmentation	Sep 15, 2019	SegmentationVideo Segmentation	—Unverified	0
Exploiting Temporality for Semi-Supervised Video Segmentation	Aug 29, 2019	DecoderImage Segmentation	CodeCode Available	0
Fast Video Object Segmentation via Mask Transfer Network	Aug 28, 2019	ObjectSemantic Segmentation	—Unverified	0
In defense of OSVOS	Aug 19, 2019	Depth EstimationObject	—Unverified	0
RANet: Ranking Attention Network for Fast Video Object Segmentation	Aug 19, 2019	DecoderObject	CodeCode Available	0
An Empirical Study of Propagation-based Methods for Video Object Segmentation	Jul 30, 2019	ObjectSemantic Segmentation	—Unverified	0
An Efficient 3D CNN for Action/Object Segmentation in Video	Jul 21, 2019	Action SegmentationDecoder	—Unverified	0
Separable Convolutional LSTMs for Faster Video Segmentation	Jul 16, 2019	GPUImage Segmentation	CodeCode Available	1
Global Optimality Guarantees for Nonconvex Unsupervised Video Segmentation	Jul 9, 2019	ObjectSegmentation	—Unverified	0
Spacetime Graph Optimization for Video Object Segmentation	Jul 7, 2019	ClusteringObject	—Unverified	0
Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation	Jul 2, 2019	ObjectObject Tracking	CodeCode Available	0
Dynamic Face Video Segmentation via Reinforcement Learning	Jul 2, 2019	Deep Reinforcement Learningreinforcement-learning	—Unverified	0

Show:10 25 50

← PrevPage 14 of 18Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified