SOTAVerified

Semantic Segmentation

Papers

Showing 36513700 of 14763 papers

TitleStatusHype
Dual-scale Enhanced and Cross-generative Consistency Learning for Semi-supervised Medical Image SegmentationCode0
PULASki: Learning inter-rater variability using statistical distances to improve probabilistic segmentationCode0
UniRef++: Segment Every Reference Object in Spatial and Temporal SpacesCode2
PointCT: Point Central Transformer Network for Weakly-supervised Point Cloud Semantic SegmentationCode1
Prototype-Based Approach for One-Shot Segmentation of Brain Tumors using Few-Shot Learning0
WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural EnvironmentsCode1
Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models0
Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentationCode2
SCUNet++: Swin-UNet and CNN Bottleneck Hybrid Architecture with Multi-Fusion Dense Skip Connection for Pulmonary Embolism CT Image SegmentationCode1
Variance-insensitive and Target-preserving Mask Refinement for Interactive Image Segmentation0
Harnessing Diffusion Models for Visual Perception with Meta PromptsCode1
SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument SegmentationCode1
Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect SegmentationCode1
Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly DetectionCode1
Weakly Supervised Semantic Segmentation for Driving ScenesCode1
TagAlign: Improving Vision-Language Alignment with Multi-Tag ClassificationCode1
BEVSeg2TP: Surround View Camera Bird's-Eye-View Based Joint Vehicle Segmentation and Ego Vehicle Trajectory Prediction0
DVIS++: Improved Decoupled Framework for Universal Video SegmentationCode1
No More Shortcuts: Realizing the Potential of Temporal Self-Supervision0
FedSODA: Federated Cross-assessment and Dynamic Aggregation for Histopathology SegmentationCode0
TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without TrainingCode1
Segment Anything Model Meets Image Harmonization0
MetaSegNet: Metadata-collaborative Vision-Language Representation Learning for Semantic Segmentation of Remote Sensing Images0
JoReS-Diff: Joint Retinex and Semantic Priors in Diffusion Model for Low-light Image Enhancement0
Multi-task Learning To Improve Semantic Segmentation Of CBCT Scans Using Image Reconstruction0
Testing the Segment Anything Model on radiology data0
Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic SegmentationCode1
FedA3I: Annotation Quality-Aware Aggregation for Federated Medical Image Segmentation against Heterogeneous Annotation NoiseCode1
Cached Transformers: Improving Transformers with Differentiable Memory CacheCode1
DDOS: The Drone Depth and Obstacle Segmentation Dataset0
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image SegmentationCode1
Surf-CDM: Score-Based Surface Cold-Diffusion Model For Medical Image Segmentation0
Rethinking LiDAR Domain Generalization: Single Source as Multiple Density DomainsCode0
MDD-UNet: Domain Adaptation for Medical Image Segmentation with Theoretical Guarantees, a Proof of ConceptCode1
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion ProcessCode2
All for One, and One for All: UrbanSyn Dataset, the third Musketeer of Synthetic Driving Scenes0
Active contours driven by local and global intensity fitting energy with application to SAR image segmentation and its fast solvers0
The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and BenchmarkCode1
Mask Grounding for Referring Image SegmentationCode1
SoftCTM: Cell detection by soft instance segmentation and consideration of cell-tissue interactionCode1
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentationCode2
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical RepresentationCode1
Language-Assisted 3D Scene Understanding0
SeeBel: Seeing is BelievingCode0
ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud UnderstandingCode1
Collaborative Learning for Annotation-Efficient Volumetric MR Image Segmentation0
Research on Multilingual Natural Scene Text Detection AlgorithmCode0
Appearance-Based Refinement for Object-Centric Motion Segmentation0
PlaNet-S: Automatic Semantic Segmentation of Placenta0
Semantic Segmentation Using Transfer Learning on Fisheye Images0
Show:102550
← PrevPage 74 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified