SOTAVerified

Scene Segmentation

Scene segmentation is the task of splitting a scene into its various object components.

Image adapted from Temporally coherent 4D reconstruction of complex dynamic scenes.

Papers

Showing 150 of 283 papers

TitleStatusHype
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene SegmentationCode4
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View RepresentationCode4
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian SplattingCode2
BEVCar: Camera-Radar Fusion for BEV Map and Object SegmentationCode2
Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable FiltersCode2
Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene SegmentationCode2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene ImageryCode2
Simplifying Object Segmentation with PixelLib LibraryCode2
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term TrackingCode1
The Coralscapes Dataset: Semantic Scene Understanding in Coral ReefsCode1
MammAlps: A multi-view video behavior monitoring dataset of wild mammals in the Swiss AlpsCode1
ROAD-Waymo: Action Awareness at Scale for Autonomous DrivingCode1
BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained DevicesCode1
Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D GaussianCode1
SMPISD-MTPNet: Scene Semantic Prior-Assisted Infrared Ship Detection Using Multi-Task Perception NetworksCode1
GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image ReconstructionCode1
Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive ReviewCode1
Fourier Prompt Tuning for Modality-Incomplete Scene SegmentationCode1
Neighbor Relations Matter in Video Scene DetectionCode1
The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and BenchmarkCode1
SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene SegmentationCode1
Transferring to Real-World Layouts: A Depth-aware Framework for Scene AdaptationCode1
CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer VisionCode1
Uncertainty Estimation for Safety-critical Scene Segmentation via Fine-grained Reward MaximizationCode1
GNeSF: Generalizable Neural Semantic FieldsCode1
APNet: Urban-level Scene Segmentation of Aerial Images and Point CloudsCode1
MA-SAM: Modality-agnostic SAM Adaptation for 3D Medical Image SegmentationCode1
Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation SurveillanceCode1
AdaptiveSAM: Towards Efficient Tuning of SAM for Surgical Scene SegmentationCode1
Unmasking Anomalies in Road-Scene SegmentationCode1
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic TransitionsCode1
SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in SurgeryCode1
Self-positioning Point-based Transformer for Point Cloud UnderstandingCode1
Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in MoviesCode1
Neural Implicit Vision-Language Feature FieldsCode1
Paced-Curriculum Distillation with Prediction and Label Uncertainty for Image SegmentationCode1
Uni-3D: A Universal Model for Panoptic 3D Scene ReconstructionCode1
Efficient Movie Scene Detection using State-Space TransformersCode1
Push-the-Boundary: Boundary-aware Feature Propagation for Semantic Segmentation of 3D Point CloudsCode1
Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic SegmentationCode1
Unsupervised RGB-to-Thermal Domain Adaptation via Multi-Domain Attention NetworkCode1
Diffusion Unit: Interpretable Edge Enhancement and Suppression Learning for 3D Point Cloud SegmentationCode1
DenseHybrid: Hybrid Anomaly Detection for Dense Open-set RecognitionCode1
IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic EnvironmentsCode1
Rethinking Surgical Instrument Segmentation: A Background Image Can Be All You NeedCode1
Scene Consistency Representation Learning for Video Scene SegmentationCode1
FIFO: Learning Fog-invariant Features for Foggy Scene SegmentationCode1
Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene SegmentationCode1
Test-time Adaptation with Slot-Centric ModelsCode1
Boundary-aware Self-supervised Learning for Video Scene SegmentationCode1
Show:102550
← PrevPage 1 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ICMMean IoU50.6Unverified
2Index NetworkMean IoU33.48Unverified
3DeepLab-LargeFOVMean IoU32.08Unverified
4SegNetMean IoU31.84Unverified
5FCNMean IoU27.39Unverified
#ModelMetricClaimedVerifiedStatus
13DMVAverage Accuracy75Unverified
2KPConv3DIoU68.6Unverified
3PointNet++Average Accuracy60.2Unverified
#ModelMetricClaimedVerifiedStatus
1Mask2AnomalyOpen-mIoU59.8Unverified
2LDN121-RPLOpen-mIoU56.3Unverified
3LDN121-DenseHybridOpen-mIoU45.8Unverified
#ModelMetricClaimedVerifiedStatus
1NeighborNetAP71.9Unverified
2TranS4merAP60.78Unverified
#ModelMetricClaimedVerifiedStatus
1UNetFormerCategory mIoU67.8Unverified