SOTAVerified

Semantic Segmentation

Papers

Showing 150 of 14763 papers

TitleStatusHype
SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction0
Unified Medical Image Segmentation with State Space Modeling Snake0
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance SegmentationCode0
DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion ModelCode0
A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique0
SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation0
Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic ApproximationCode2
Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping1
U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKVCode1
Personalized OVSS: Understanding Personal Concept in Open-Vocabulary Semantic Segmentation0
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second0
Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks0
A New Dataset and Performance Benchmark for Real-time Spacecraft Segmentation in Onboard Flight ComputersCode0
DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic SegmentationCode0
Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation0
Prompt Engineering in Segment Anything Model: Methodologies, Applications, and Emerging Challenges0
Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks0
HNOSeg-XS: Extremely Small Hartley Neural Operator for Efficient and Resolution-Robust 3D Image SegmentationCode0
MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation0
Rethinking Query-based Transformer for Continual Image SegmentationCode1
LIRA: Inferring Segmentation in Large Multi-modal Models with Local Interleaved Region AssistanceCode0
SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning0
Empowering Bridge Digital Twins by Bridging the Data Gap with a Unified Synthesis Framework0
DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation0
Just Say Better or Worse: A Human-AI Collaborative Framework for Medical Image Segmentation Without Manual Annotations0
I^2R: Inter and Intra-image Refinement in Few Shot Segmentation0
RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation ModelsCode1
Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation0
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts0
NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World ModelsCode0
SAMed-2: Selective Memory Enhanced Medical Segment Anything ModelCode1
Leveraging Out-of-Distribution Unlabeled Images: Semi-Supervised Semantic Segmentation with an Open-Vocabulary ModelCode0
Causal-SAM-LLM: Large Language Models as Causal Reasoners for Robust Medical Segmentation0
From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images0
No time to train! Training-Free Reference-Based Instance SegmentationCode3
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback SynergyCode1
Autoadaptive Medical Segment Anything ModelCode0
NOCTIS: Novel Object Cyclic Threshold based Instance SegmentationCode0
Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic InferenceCode0
Prompt2SegCXR:Prompt to Segment All Organs and Diseases in Chest X-rays0
Process-aware and high-fidelity microstructure generation using stable diffusion0
GroundingDINO-US-SAM: Text-Prompted Multi-Organ Segmentation in Ultrasound with LoRA-Tuned Vision-Language Models0
MedSAM-CA: A CNN-Augmented ViT with Attention-Enhanced Multi-Scale Fusion for Medical Image Segmentation0
DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation0
Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and GrounderCode1
Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval0
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding0
ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts0
TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models0
Show:102550
← PrevPage 1 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified