SOTAVerified

Semantic Segmentation

Papers

Showing 701750 of 14763 papers

TitleStatusHype
ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic SegmentationCode2
Open-Set Domain Adaptation for Semantic SegmentationCode2
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural NetworksCode2
DDP: Diffusion Model for Dense Visual PredictionCode2
ORFD: A Dataset and Benchmark for Off-Road Freespace DetectionCode2
Deep Covariance Alignment for Domain Adaptive Remote Sensing Image SegmentationCode2
Bidirectional Copy-Paste for Semi-Supervised Medical Image SegmentationCode2
DAMamba: Vision State Space Model with Dynamic Adaptive ScanCode2
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense PredictionCode2
Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene SegmentationCode2
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion ModelsCode2
Customized Segment Anything Model for Medical Image SegmentationCode2
Asymmetric Non-local Neural Networks for Semantic SegmentationCode2
PEM: Prototype-based Efficient MaskFormer for Image SegmentationCode2
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image ClassificationCode2
Dataset QuantizationCode2
Cross Language Image Matching for Weakly Supervised Semantic SegmentationCode2
PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image SegmentationCode2
Cross-Image Relational Knowledge Distillation for Semantic SegmentationCode2
Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT ImagesCode2
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic CorrespondenceCode2
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training ParadigmCode2
AgileFormer: Spatially Agile Transformer UNet for Medical Image SegmentationCode2
Atlas: End-to-End 3D Scene Reconstruction from Posed ImagesCode2
CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic SegmentationCode2
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale AttentionCode2
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded ScenesCode2
DAT++: Spatially Dynamic Vision Transformer with Deformable AttentionCode2
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive ReviewCode2
CNOS: A Strong Baseline for CAD-based Novel Object SegmentationCode2
Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D ConvolutionsCode2
Recent Advances in Medical Imaging Segmentation: A SurveyCode2
Deep Snake for Real-Time Instance SegmentationCode2
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse TransformersCode2
FedFMS: Exploring Federated Foundation Models for Medical Image SegmentationCode2
LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse KernelsCode2
ResT V2: Simpler, Faster and StrongerCode2
Rethinking Boundary Detection in Deep Learning-Based Medical Image SegmentationCode2
SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical ImagesCode2
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse PromptsCode2
Rethinking Patch Dependence for Masked AutoencodersCode2
Beyond Semantic to Instance Segmentation: Weakly-Supervised Instance Segmentation via Semantic Knowledge Transfer and Self-RefinementCode1
Beyond Self-Supervision: A Simple Yet Effective Network Distillation Alternative to Improve BackbonesCode1
CP2: Copy-Paste Contrastive Pretraining for Semantic SegmentationCode1
Beyond the Prototype: Divide-and-conquer Proxies for Few-shot SegmentationCode1
CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic SegmentationCode1
Beyond pixel-wise supervision for segmentation: A few global shape descriptors might be surprisingly good!Code1
Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label ClassifierCode1
CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image SegmentationCode1
Beyond One-to-One: Rethinking the Referring Image SegmentationCode1
Show:102550
← PrevPage 15 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified