SOTAVerified

Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Showing 301350 of 895 papers

TitleStatusHype
Event-guided Low-light Video Semantic Segmentation0
Click Carving: Segmenting Objects in Video with Point Clicks0
ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation0
An Empirical Study of Propagation-based Methods for Video Object Segmentation0
EntitySAM: Segment Everything in Video0
End to End Video Segmentation for Driving : Lane Detection For Autonomous Car0
Classifier Based Graph Construction for Video Segmentation0
An Efficient 3D CNN for Action/Object Segmentation in Video0
An Analysis of Data Transformation Effects on Segment Anything 20
Leveraging Motion Information for Better Self-Supervised Video Correspondence Learning0
EISeg: An Efficient Interactive Segmentation Tool based on PaddlePaddle0
Causal Video Object Segmentation From Persistence of Occlusions0
Efficient Video Semantic Segmentation with Labels Propagation and Refinement0
Efficient Video Segmentation Using Parametric Graph Partitioning0
Learning Video Object Segmentation with Visual Memory0
Efficient Video Segmentation Models with Per-frame Inference0
Can Ground Truth Label Propagation from Video help Semantic Segmentation?0
Learning to Track Any Object0
Lifelong Learning Using a Dynamically Growing Tree of Sub-networks for Domain Generalization in Video Object Segmentation0
LSVOS Challenge 3rd Place Report: SAM2 and Cutie based VOS0
Efficient Unsupervised Video Object Segmentation Network Based on Motion Guidance0
CamoSAM2: Motion-Appearance Induced Auto-Refining Prompts for Video Camouflaged Object Detection0
A Machine Learning-based Segmentation Approach for Measuring Similarity between Sign Languages0
Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors0
Efficient MRF Energy Propagation for Video Segmentation via Bilateral Filters0
Budget-Aware Deep Semantic Video Segmentation0
Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation0
6D Pose Estimation on Spoons and Hands0
Efficient Long-Short Temporal Attention Network for Unsupervised Video Object Segmentation0
Efficient Heterogeneous Video Segmentation at the Edge0
1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation0
Learning To Segment Dominant Object Motion From Watching Videos0
Learning to Segment Human by Watching YouTube0
Dynamic Video Segmentation Network0
Bringing Background into the Foreground: Making All Classes Equal in Weakly-supervised Video Semantic Segmentation0
Dynamic Face Video Segmentation via Reinforcement Learning0
A Semi-Self-Supervised Approach for Dense-Pattern Video Object Segmentation0
Breaking the "Object" in Video Object Segmentation0
5th Place Solution for YouTube-VOS Challenge 2022: Video Object Segmentation0
Learning to Segment Moving Objects in Videos0
Learning to Segment Referred Objects from Narrated Egocentric Videos0
Breaking The Ice: Video Segmentation for Close-Range Ice-Covered Waters0
Dual Temporal Memory Network for Efficient Video Object Segmentation0
A Fast Two Pass Multi-Value Segmentation Algorithm based on Connected Component Analysis0
Learning to Adapt to Online Streams with Distribution Shifts0
3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation0
Harnessing Vision-Language Pretrained Models with Temporal-Aware Adaptation for Referring Video Object Segmentation0
Learning the What and How of Annotation in Video Object Segmentation0
Learning to Better Segment Objects from Unseen Classes with Unlabeled Videos0
DNN-Driven Compressive Offloading for Edge-Assisted Semantic Video Segmentation0
Show:102550
← PrevPage 7 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TMANet-50mIoU80.3Unverified
2TDNet-50 [9]mIoU79.9Unverified
3DeltaDist-DDRNet-39mIoU79.9Unverified
4PSPNet-101 [20]mIoU79.7Unverified
5PSPNet-50 [20]mIoU78.1Unverified
6LVS [12]mIoU76.8Unverified
7GRFP [15]mIoU73.6Unverified
8FCN-50 [14]mIoU70.1Unverified
9DFF [22]mIoU69.2Unverified
#ModelMetricClaimedVerifiedStatus
1TMANet-50Mean IoU76.5Unverified
2ETC-MobileNetMean IoU76.3Unverified
3TDNet-50Mean IoU76.2Unverified
4PSPNet-50Mean IoU76Unverified
5NetwarpMean IoU74.7Unverified
6GRFPMean IoU67.1Unverified
#ModelMetricClaimedVerifiedStatus
1DVIS++(VIT-L)mIoU63.8Unverified
2UniVS(Swin-L)mIoU59.8Unverified
3Tube-Link(Swin-large)mIoU59.6Unverified
4MRCFA(MiT-B5)mIoU49.9Unverified
5CFFM(MiT-B5)mIoU49.3Unverified
#ModelMetricClaimedVerifiedStatus
1WaSR-T (ResNet-101)Q60.1Unverified
2TMANet (ResNet-50)Q57.5Unverified
3CSANet (ResNet-101)Q49.1Unverified
#ModelMetricClaimedVerifiedStatus
1MVNet(DeepLabV3)mIoU54.52Unverified
2MVNet(PSPNet)mIoU54.36Unverified
3MVNet(FCN)mIoU53.9Unverified