SOTAVerified

Semantic Segmentation

Papers

Showing 2650 of 14763 papers

TitleStatusHype
I^2R: Inter and Intra-image Refinement in Few Shot Segmentation0
RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation ModelsCode1
Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation0
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts0
NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World ModelsCode0
SAMed-2: Selective Memory Enhanced Medical Segment Anything ModelCode1
Leveraging Out-of-Distribution Unlabeled Images: Semi-Supervised Semantic Segmentation with an Open-Vocabulary ModelCode0
Causal-SAM-LLM: Large Language Models as Causal Reasoners for Robust Medical Segmentation0
From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images0
No time to train! Training-Free Reference-Based Instance SegmentationCode3
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback SynergyCode1
Autoadaptive Medical Segment Anything ModelCode0
NOCTIS: Novel Object Cyclic Threshold based Instance SegmentationCode0
Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic InferenceCode0
Prompt2SegCXR:Prompt to Segment All Organs and Diseases in Chest X-rays0
Process-aware and high-fidelity microstructure generation using stable diffusion0
GroundingDINO-US-SAM: Text-Prompted Multi-Organ Segmentation in Ultrasound with LoRA-Tuned Vision-Language Models0
MedSAM-CA: A CNN-Augmented ViT with Attention-Enhanced Multi-Scale Fusion for Medical Image Segmentation0
DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation0
Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and GrounderCode1
Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval0
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding0
ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts0
TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models0
Show:102550
← PrevPage 2 of 591Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified