SOTAVerified

Semantic Segmentation

Papers

Showing 32513300 of 14763 papers

TitleStatusHype
Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal EffectCode1
Deep Multimodal Fusion by Channel ExchangingCode1
Cylinder3D: An Effective 3D Framework for Driving-scene LiDAR Semantic SegmentationCode1
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-based PerceptionCode1
Hybrid guiding: A multi-resolution refinement approach for semantic segmentation of gigapixel histopathological imagesCode1
Hybrid Open-set Segmentation with Synthetic Negative DataCode1
HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image SegmentationCode1
D2A U-Net: Automatic Segmentation of COVID-19 Lesions from CT Slices with Dilated Convolution and Dual Attention MechanismCode1
D^2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in VideosCode1
D2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in VideosCode1
D2Det: Towards High Quality Object Detection and Instance SegmentationCode1
LoG-VMamba: Local-Global Vision Mamba for Medical Image SegmentationCode1
Long-tailed Distribution AdaptationCode1
HyperionSolarNet: Solar Panel Detection from Aerial ImagesCode1
D3RM: A Discrete Denoising Diffusion Refinement Model for Piano TranscriptionCode1
Rethinking Decoder Design: Improving Biomarker Segmentation Using Depth-to-Space Restoration and Residual Linear AttentionCode1
Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing ImagesCode1
DAAIN: Detection of Anomalous and Adversarial Input using Normalizing FlowsCode1
Deep Metric Learning for Open World Semantic SegmentationCode1
BuildingNet: Learning to Label 3D BuildingsCode1
Local Temperature Scaling for Probability CalibrationCode1
DACS: Domain Adaptation via Cross-domain Mixed SamplingCode1
DeepMIM: Deep Supervision for Masked Image ModelingCode1
Rethinking Local Perception in Lightweight Vision TransformerCode1
Deeply supervised salient object detection with short connectionsCode1
Locally Enhanced Self-Attention: Combining Self-Attention and Convolution as Local and Context TermsCode1
Location-Sensitive Visual Recognition with Cross-IOU LossCode1
DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic SegmentationCode1
Building Extraction from Remote Sensing Images via an Uncertainty-Aware NetworkCode1
Local-Global Context Aware Transformer for Language-Guided Video SegmentationCode1
Revisiting the Encoding of Satellite Image Time SeriesCode1
Local Intensity Order Transformation for Robust Curvilinear Object SegmentationCode1
Deep-learning in the bioimaging wild: Handling ambiguous data with deepflash2Code1
Zero-Shot Semantic SegmentationCode1
Retina U-Net: Embarrassingly Simple Exploitation of Segmentation Supervision for Medical Object DetectionCode1
RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place RecognitionCode1
LM-Net: A Light-weight and Multi-scale Network for Medical Image SegmentationCode1
LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point CloudsCode1
IFSeg: Image-free Semantic Segmentation via Vision-Language ModelCode1
Look-into-Object: Self-supervised Structure Modeling for Object RecognitionCode1
LVIS: A Dataset for Large Vocabulary Instance SegmentationCode1
Building a Strong Pre-Training Baseline for Universal 3D Large-Scale PerceptionCode1
LiVOS: Light Video Object Segmentation with Gated Linear MatchingCode1
Illumination-based Transformations Improve Skin Lesion Segmentation in Dermoscopic ImagesCode1
LIVECell—A large-scale dataset for label-free live cell segmentationCode1
Lite Vision Transformer with Enhanced Self-AttentionCode1
Livelayer: A Semi-Automatic Software Program for Segmentation of Layers and Diabetic Macular Edema in Optical Coherence Tomography ImagesCode1
Image-Based Relocalization and Alignment for Long-Term Monitoring of Dynamic Underwater EnvironmentsCode1
LKCell: Efficient Cell Nuclei Instance Segmentation with Large Convolution KernelsCode1
LinkNet: Exploiting Encoder Representations for Efficient Semantic SegmentationCode1
Show:102550
← PrevPage 66 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified