SOTAVerified

Semantic Segmentation

Papers

Showing 301350 of 14763 papers

TitleStatusHype
A Unified Framework for 3D Scene UnderstandingCode2
HiDiff: Hybrid Diffusion Framework for Medical Image SegmentationCode2
Context-Aware Video Instance SegmentationCode2
Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse WeatherCode2
Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual PromptsCode2
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment RetrievalCode2
Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse ProcessCode2
SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point CloudCode2
Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation?Code2
SelfReg-UNet: Self-Regularized UNet for Medical Image SegmentationCode2
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic SegmentationCode2
Scaling Efficient Masked Image Modeling on Large Remote Sensing DatasetCode2
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale DatasetCode2
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and LanguageCode2
Medical Vision Generalist: Unifying Medical Imaging Tasks in ContextCode2
DSNet: A Novel Way to Use Atrous Convolutions in Semantic SegmentationCode2
Parameter-Inverted Image Pyramid NetworksCode2
DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized CutCode2
U-KAN Makes Strong Backbone for Medical Image Segmentation and GenerationCode2
Generative Active Learning for Long-tailed Instance SegmentationCode2
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision TransformerCode2
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic SegmentationCode2
TotalVibeSegmentator: Full Body MRI Segmentation for the NAKO and UK BiobankCode2
Open-Set Domain Adaptation for Semantic SegmentationCode2
Adapting Pre-Trained Vision Models for Novel Instance Detection and SegmentationCode2
Memorize What Matters: Emergent Scene Decomposition from MultitraverseCode2
Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion ModelsCode2
Mamba-R: Vision Mamba ALSO Needs RegistersCode2
KPConvX: Modernizing Kernel Point Convolution with Kernel AttentionCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Context-Guided Spatial Feature Reconstruction for Efficient Semantic SegmentationCode2
Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale AttentionCode2
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNsCode2
OpenESS: Event-based Semantic Scene Understanding with Open VocabulariesCode2
PTQ4SAM: Post-Training Quantization for Segment AnythingCode2
MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation LearningCode2
ASAM: Boosting Segment Anything Model with Adversarial TuningCode2
Adaptive Bidirectional Displacement for Semi-Supervised Medical Image SegmentationCode2
LVOS: A Benchmark for Large-scale Long-term Video Object SegmentationCode2
Multi-Scale Representations by Varying Window Attention for Semantic SegmentationCode2
A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic SegmentationCode2
Multimodal Information Interaction for Medical Image SegmentationCode2
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object SegmentationCode2
Augmented Object Intelligence with XR-ObjectsCode2
Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation FrameworkCode2
MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space ModelCode2
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-TrainingCode2
Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image SegmentationCode2
LaSagnA: Language-based Segmentation Assistant for Complex QueriesCode2
LLM-Seg: Bridging Image Segmentation and Large Language Model ReasoningCode2
Show:102550
← PrevPage 7 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified