Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–650 of 895 papers

Title	Date	Tasks	Status	Hype
Learning Spatio-Appearance Memory Network for High-Performance Visual Tracking	Sep 21, 2020	Object TrackingSegmentation	CodeCode Available	1
PMVOS: Pixel-Level Matching-Based Video Object Segmentation	Sep 18, 2020	ObjectOne-shot visual object segmentation	—Unverified	0
Ground-truth or DAER: Selective Re-query of Secondary Information	Sep 16, 2020	Object TrackingScene Classification	CodeCode Available	0
LSMVOS: Long-Short-Term Similarity Matching for Video Object	Sep 2, 2020	ObjectOptical Flow Estimation	CodeCode Available	0
Making a Case for 3D Convolutions for Object Segmentation in Videos	Aug 26, 2020	DecoderSegmentation	CodeCode Available	1
ScribbleBox: Interactive Annotation Framework for Video Object Segmentation	Aug 22, 2020	ObjectSegmentation	—Unverified	0
MATNet: Motion-Attentive Transition Network for Zero-Shot Video Object Segmentation	Aug 20, 2020	ObjectSemantic Segmentation	CodeCode Available	1
DyStaB: Unsupervised Object Segmentation via Dynamic-Static Bootstrapping	Aug 16, 2020	Continual LearningObject	—Unverified	0
Monocular Instance Motion Segmentation for Autonomous Driving: KITTI InstanceMotSeg Dataset and Multi-task Baseline	Aug 16, 2020	Autonomous DrivingAutonomous Vehicles	—Unverified	0
Curriculum Learning for Recurrent Video Object Segmentation	Aug 15, 2020	ObjectSemantic Segmentation	CodeCode Available	0
Learning Discriminative Feature with CRF for Unsupervised Video Object Segmentation	Aug 4, 2020	RGB Salient Object DetectionSemantic Segmentation	—Unverified	0
Self-supervised Object Tracking with Cycle-consistent Siamese Networks	Aug 3, 2020	ObjectObject Tracking	CodeCode Available	1
Unsupervised Video Object Segmentation with Joint Hotspot Tracking	Aug 1, 2020	Gaze EstimationObject	—Unverified	0
Self-supervised Motion Representation via Scattering Local Motion Cues	Aug 1, 2020	Action RecognitionOptical Flow Estimation	—Unverified	0
URVOS: Unified Referring Video Object Segmentation Network with a Large-Scale Benchmark	Aug 1, 2020	ObjectOne-shot visual object segmentation	CodeCode Available	1
Memory Selection Network for Video Propagation	Aug 1, 2020	ColorizationSemantic Segmentation	—Unverified	0
Interactive Video Object Segmentation Using Global and Local Transfer Modules	Jul 16, 2020	DecoderInteractive Video Object Segmentation	CodeCode Available	1
Kernelized Memory Network for Video Object Segmentation	Jul 16, 2020	ObjectSemantic Segmentation	CodeCode Available	1
Video Object Segmentation with Episodic Graph Memory Networks	Jul 14, 2020	ObjectSegmentation	CodeCode Available	1
DeU-Net: Deformable U-Net for 3D Cardiac MRI Video Segmentation	Jul 13, 2020	Video SegmentationVideo Semantic Segmentation	—Unverified	0
Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching	Jul 11, 2020	ObjectObject Tracking	—Unverified	0
Learning Object Depth from Camera Motion and Video Object Segmentation	Jul 11, 2020	ObjectSegmentation	CodeCode Available	1
Motion Prediction in Visual Object Tracking	Jul 1, 2020	Autonomous Drivingmotion prediction	—Unverified	0
Robust Semantic Segmentation in Adverse Weather Conditions by means of Fast Video-Sequence Segmentation	Jul 1, 2020	Image SegmentationSegmentation	CodeCode Available	1
Self-supervised Video Object Segmentation	Jun 22, 2020	ObjectOne-shot visual object segmentation	CodeCode Available	0
Video Panoptic Segmentation	Jun 19, 2020	Instance SegmentationPanoptic Segmentation	CodeCode Available	1
Video Semantic Segmentation with Distortion-Aware Feature Correction	Jun 18, 2020	Image SegmentationOptical Flow Estimation	CodeCode Available	1
Real-Time Video Inference on Edge Devices via Adaptive Model Streaming	Jun 11, 2020	Knowledge DistillationSemantic Segmentation	CodeCode Available	1
Video Instance Segmentation Tracking With a Modified VAE Architecture	Jun 1, 2020	Instance Segmentationobject-detection	—Unverified	0
D3S - A Discriminative Single Shot Segmentation Tracker	Jun 1, 2020	ObjectObject Tracking	—Unverified	0
Visual-Textual Capsule Routing for Text-Based Video Segmentation	Jun 1, 2020	Action LocalizationReferring Expression Segmentation	—Unverified	0
Temporal Aggregate Representations for Long-Range Video Understanding	Jun 1, 2020	Action AnticipationAction Recognition	CodeCode Available	1
ALBA : Reinforcement Learning for Video Object Segmentation	May 26, 2020	ObjectOne-shot visual object segmentation	CodeCode Available	0
Tamed Warping Network for High-Resolution Semantic Video Segmentation	May 4, 2020	Motion EstimationReal-Time Semantic Segmentation	—Unverified	0
MEDIAPI-SKEL - A 2D-Skeleton Video Database of French Sign Language With Aligned French Subtitles	May 1, 2020	Cross-Modal RetrievalRetrieval	—Unverified	0
Physarum Powered Differentiable Linear Programming Layers and Applications	Apr 30, 2020	Few-Shot LearningMeta-Learning	CodeCode Available	1
Revisiting Sequence-to-Sequence Video Object Segmentation with Multi-Task Loss and Skip-Memory	Apr 25, 2020	DecoderObject	CodeCode Available	0
LSM: Learning Subspace Minimization for Low-level Vision	Apr 20, 2020	Image SegmentationOptical Flow Estimation	—Unverified	0
Fast Template Matching and Update for Video Object Tracking and Segmentation	Apr 16, 2020	Object Trackingreinforcement-learning	CodeCode Available	1
A Transductive Approach for Video Object Segmentation	Apr 15, 2020	Instance SegmentationObject	CodeCode Available	1
Real-Time Segmentation Networks should be Latency Aware	Apr 6, 2020	Autonomous VehiclesScene Segmentation	—Unverified	0
Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries	Apr 3, 2020	Referring Expression SegmentationVideo Segmentation	—Unverified	0
Temporally Distributed Networks for Fast Video Semantic Segmentation	Apr 3, 2020	Knowledge DistillationReal-Time Semantic Segmentation	CodeCode Available	1
Memory Aggregation Networks for Efficient Interactive Video Object Segmentation	Mar 30, 2020	Interactive Video Object SegmentationObject	—Unverified	0
TapLab: A Fast Framework for Semantic Video Segmentation Tapping into Compressed-Domain Knowledge	Mar 30, 2020	GPUImage Segmentation	CodeCode Available	1
Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection	Mar 29, 2020	Action SegmentationSegmentation	—Unverified	0
Coronary Artery Segmentation in Angiographic Videos Using A 3D-2D CE-Net	Mar 26, 2020	Coronary Artery SegmentationSegmentation	—Unverified	0
Learning What to Learn for Video Object Segmentation	Mar 25, 2020	Few-Shot LearningObject	CodeCode Available	1
Collaborative Video Object Segmentation by Foreground-Background Integration	Mar 18, 2020	ObjectOne-shot visual object segmentation	CodeCode Available	1
Dual Temporal Memory Network for Efficient Video Object Segmentation	Mar 13, 2020	ObjectOne-shot visual object segmentation	—Unverified	0

Show:10 25 50

← PrevPage 13 of 18Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified