SOTAVerified

Scene Segmentation

Scene segmentation is the task of splitting a scene into its various object components.

Image adapted from Temporally coherent 4D reconstruction of complex dynamic scenes.

Papers

Showing 76100 of 283 papers

TitleStatusHype
Elastic Interaction Energy-Informed Real-Time Traffic Scene Perception0
APNet: Urban-level Scene Segmentation of Aerial Images and Point CloudsCode1
MA-SAM: Modality-agnostic SAM Adaptation for 3D Medical Image SegmentationCode1
Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation SurveillanceCode1
Self-Supervised Pre-Training Boosts Semantic Scene Segmentation on LiDAR DataCode0
FOR-instance: a UAV laser scanning benchmark dataset for semantic and instance segmentation of individual trees0
Robotic Scene Segmentation with Memory Network for Runtime Surgical Context InferenceCode0
MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation0
AdaptiveSAM: Towards Efficient Tuning of SAM for Surgical Scene SegmentationCode1
Unmasking Anomalies in Road-Scene SegmentationCode1
Learning Content-enhanced Mask Transformer for Domain Generalized Urban-Scene SegmentationCode0
MOVES: Movable and Moving LiDAR Scene Segmentation in Label-Free settings using Static Reconstruction0
Cross-CBAM: A Lightweight network for Scene Segmentation0
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic TransitionsCode1
SSS3D: Fast Neural Architecture Search For Efficient Three-Dimensional Semantic Segmentation0
SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in SurgeryCode1
CROVIA: Seeing Drone Scenes from Car Perspective via Cross-View Adaptation0
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene UnderstandingCode0
Self-positioning Point-based Transformer for Point Cloud UnderstandingCode1
Domain Adaptive Semantic Segmentation by Optimal Transport0
Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in MoviesCode1
Semantic segmentation of surgical hyperspectral images under geometric domain shiftsCode0
Neural Implicit Vision-Language Feature FieldsCode1
Implicit Ray-Transformers for Multi-view Remote Sensing Image Segmentation0
Towards Surgical Context Inference and Translation to GesturesCode0
Show:102550
← PrevPage 4 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ICMMean IoU50.6Unverified
2Index NetworkMean IoU33.48Unverified
3DeepLab-LargeFOVMean IoU32.08Unverified
4SegNetMean IoU31.84Unverified
5FCNMean IoU27.39Unverified
#ModelMetricClaimedVerifiedStatus
13DMVAverage Accuracy75Unverified
2KPConv3DIoU68.6Unverified
3PointNet++Average Accuracy60.2Unverified
#ModelMetricClaimedVerifiedStatus
1Mask2AnomalyOpen-mIoU59.8Unverified
2LDN121-RPLOpen-mIoU56.3Unverified
3LDN121-DenseHybridOpen-mIoU45.8Unverified
#ModelMetricClaimedVerifiedStatus
1NeighborNetAP71.9Unverified
2TranS4merAP60.78Unverified
#ModelMetricClaimedVerifiedStatus
1UNetFormerCategory mIoU67.8Unverified