Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 701–750 of 895 papers

Title	Date	Tasks	Status
Fast Video Object Segmentation via Mask Transfer Network	Aug 28, 2019	ObjectSemantic Segmentation	—Unverified
In defense of OSVOS	Aug 19, 2019	Depth EstimationObject	—Unverified
RANet: Ranking Attention Network for Fast Video Object Segmentation	Aug 19, 2019	DecoderObject	CodeCode Available
An Empirical Study of Propagation-based Methods for Video Object Segmentation	Jul 30, 2019	ObjectSemantic Segmentation	—Unverified
An Efficient 3D CNN for Action/Object Segmentation in Video	Jul 21, 2019	Action SegmentationDecoder	—Unverified
Global Optimality Guarantees for Nonconvex Unsupervised Video Segmentation	Jul 9, 2019	ObjectSegmentation	—Unverified
Spacetime Graph Optimization for Video Object Segmentation	Jul 7, 2019	ClusteringObject	—Unverified
Dynamic Face Video Segmentation via Reinforcement Learning	Jul 2, 2019	Deep Reinforcement Learningreinforcement-learning	—Unverified
Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation	Jul 2, 2019	ObjectObject Tracking	CodeCode Available
Key Instance Selection for Unsupervised Video Object Segmentation	Jun 18, 2019	ObjectSegmentation	—Unverified
Learning Unsupervised Video Object Segmentation Through Visual Attention	Jun 1, 2019	ObjectSegmentation	CodeCode Available
SAIL-VOS: Semantic Amodal Instance Level Video Object Segmentation - A Synthetic Dataset and Baselines	Jun 1, 2019	SegmentationSemantic Segmentation	—Unverified
Object Instance Annotation With Deep Extreme Level Set Evolution	Jun 1, 2019	ObjectSegmentation	CodeCode Available
OVSNet : Towards One-Pass Real-Time Video Object Segmentation	May 24, 2019	Objectobject-detection	—Unverified
Fully Hyperbolic Convolutional Neural Networks	May 24, 2019	Depth EstimationGeneral Classification	—Unverified
U-Net Based Multi-instance Video Object Segmentation	May 19, 2019	Instance SegmentationObject	—Unverified
Self-supervised Learning for Video Correspondence Flow	May 2, 2019	Self-Supervised LearningSemi-Supervised Video Object Segmentation	CodeCode Available
The 2019 DAVIS Challenge on VOS: Unsupervised Multi-Object Segmentation	May 2, 2019	ObjectSegmentation	—Unverified
On guiding video object segmentation	Apr 25, 2019	Foreground SegmentationObject	—Unverified
Fast User-Guided Video Object Segmentation by Interaction-and-Propagation Networks	Apr 22, 2019	Interactive Video Object SegmentationObject	CodeCode Available
Video Object Segmentation and Tracking: A Survey	Apr 19, 2019	Autonomous VehiclesObject	—Unverified
Discriminative Online Learning for Fast Video Object Segmentation	Apr 18, 2019	ObjectOne-shot visual object segmentation	—Unverified
MHP-VOS: Multiple Hypotheses Propagation for Video Object Segmentation	Apr 17, 2019	Decision MakingObject	CodeCode Available
VORNet: Spatio-temporally Consistent Video Inpainting for Object Removal	Apr 14, 2019	Image InpaintingObject	—Unverified
MAIN: Multi-Attention Instance Network for Video Segmentation	Apr 11, 2019	One-shot visual object segmentationSegmentation	—Unverified
BoLTVOS: Box-Level Tracking for Video Object Segmentation	Apr 9, 2019	ObjectOne-shot visual object segmentation	—Unverified
Prediction-Tracking-Segmentation	Apr 5, 2019	PredictionSegmentation	—Unverified
Spatiotemporal CNN for Video Object Segmentation	Apr 4, 2019	ObjectSegmentation	CodeCode Available
Architecture Search of Dynamic Cells for Semantic Video Segmentation	Apr 4, 2019	GPUNeural Architecture Search	—Unverified
Patchwork: A Patch-wise Attention Network for Efficient Object Detection and Segmentation in Video Streams	Apr 3, 2019	Hard Attentionobject-detection	—Unverified
Multigrid Predictive Filter Flow for Unsupervised Learning on Videos	Apr 2, 2019	Optical Flow EstimationPose Tracking	CodeCode Available
Fast video object segmentation with Spatio-Temporal GANs	Mar 28, 2019	DescriptiveObject	—Unverified
BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames	Mar 28, 2019	SegmentationSemantic Segmentation	CodeCode Available
Rethinking the Evaluation of Video Summaries	Mar 27, 2019	Video SegmentationVideo Semantic Segmentation	CodeCode Available
Value of Temporal Dynamics Information in Driving Scene Segmentation	Mar 21, 2019	Scene SegmentationSegmentation	—Unverified
Learning Correspondence from the Cycle-Consistency of Time	Mar 18, 2019	Optical Flow EstimationSemantic Segmentation	CodeCode Available
FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation	Feb 25, 2019	ObjectSegmentation	CodeCode Available
Adaptive Masked Proxies for Few-Shot Segmentation	Feb 19, 2019	Continual LearningFew-Shot Semantic Segmentation	CodeCode Available
Towards Segmenting Anything That Moves	Feb 11, 2019	Action DetectionInstance Segmentation	CodeCode Available
Multi-stream CNN based Video Semantic Segmentation for Automated Driving	Jan 8, 2019	DecoderSemantic Segmentation	—Unverified
Unsupervised Video Object Segmentation with Distractor-Aware Online Adaptation	Dec 19, 2018	Instance SegmentationObject	—Unverified
End to End Video Segmentation for Driving : Lane Detection For Autonomous Car	Dec 13, 2018	Autonomous DrivingLane Detection	—Unverified
Design Pseudo Ground Truth with Motion Cue for Unsupervised Video Object Segmentation	Dec 13, 2018	Instance SegmentationObject	—Unverified
Meta Learning Deep Visual Words for Fast Video Object Segmentation	Dec 4, 2018	Meta-LearningObject	CodeCode Available
Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries	Dec 2, 2018	Action LocalizationNatural Language Queries	—Unverified
CCNet: Criss-Cross Attention for Semantic Segmentation	Nov 28, 2018	Computational EfficiencyGPU	CodeCode Available
A Generative Appearance Model for End-to-end Video Object Segmentation	Nov 28, 2018	GPUOne-shot visual object segmentation	CodeCode Available
Complementary Segmentation of Primary Video Objects with Reversible Flows	Nov 23, 2018	SuperpixelsVideo Semantic Segmentation	—Unverified
Creatures great and SMAL: Recovering the shape and motion of animals from video	Nov 14, 2018	Video SegmentationVideo Semantic Segmentation	—Unverified
Unsupervised RGBD Video Object Segmentation Using GANs	Nov 5, 2018	ObjectSegmentation	—Unverified

Show:10 25 50

← PrevPage 15 of 18Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified