SOTAVerified

Semantic Segmentation

Papers

Showing 651700 of 14763 papers

TitleStatusHype
DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask DiffusionCode2
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion ModelsCode2
Mask2Former for Video Instance SegmentationCode2
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal PromptingCode2
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulatorCode2
Masked Generative DistillationCode2
Dataset QuantizationCode2
MaskTerial: A Foundation Model for Automated 2D Material Flake DetectionCode2
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic SegmentationCode2
DAMamba: Vision State Space Model with Dynamic Adaptive ScanCode2
MCIBI++: Soft Mining Contextual Information Beyond Image for Semantic SegmentationCode2
Caltech Aerial RGB-Thermal Dataset in the WildCode2
DAT++: Spatially Dynamic Vision Transformer with Deformable AttentionCode2
Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT ImagesCode2
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything ModelCode2
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic SegmentationCode2
2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point CloudsCode2
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded ScenesCode2
MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series AnalysisCode2
MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven Universal ModelCode2
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic SegmentationCode2
Adversarial Supervision Makes Layout-to-Image Diffusion Models ThriveCode2
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
Cross-Image Relational Knowledge Distillation for Semantic SegmentationCode2
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based TrainingCode2
MIS-FM: 3D Medical Image Segmentation using Foundation Models Pretrained on a Large-Scale Unannotated DatasetCode2
CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image FusionCode2
MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image SegmentationCode2
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural NetworksCode2
ARKit LabelMaker: A New Scale for Indoor 3D Scene UnderstandingCode2
Modeling the Label Distributions for Weakly-Supervised Semantic SegmentationCode2
More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using SparsityCode2
Cross Language Image Matching for Weakly Supervised Semantic SegmentationCode2
Moving Object Segmentation in Point Cloud Data using Hidden Markov ModelsCode2
Customized Segment Anything Model for Medical Image SegmentationCode2
MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image SegmentationCode2
Multimodal Information Interaction for Medical Image SegmentationCode2
ASAM: Boosting Segment Anything Model with Adversarial TuningCode2
Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial ImageryCode2
Multi-Scale Representations by Varying Window Attention for Semantic SegmentationCode2
Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentationCode2
Neighborhood Attention TransformerCode2
SlicerNNInteractive: A 3D Slicer extension for nnInteractiveCode2
nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space ModelCode2
DaViT: Dual Attention Vision TransformersCode2
BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image SegmentationCode2
Coordinate Attention for Efficient Mobile Network DesignCode2
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic SegmentationCode2
Occlusion-Aware Instance Segmentation via BiLayer Network ArchitecturesCode2
BEiT: BERT Pre-Training of Image TransformersCode2
Show:102550
← PrevPage 14 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified