SOTAVerified

Semantic Segmentation

Papers

Showing 126150 of 14763 papers

TitleStatusHype
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language InterfaceCode3
MA-Net: A Multi-Scale Attention Network for Liver and Tumor SegmentationCode3
MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic ModelCode3
No time to train! Training-Free Reference-Based Instance SegmentationCode3
Point Transformer V3: Simpler, Faster, StrongerCode3
LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image SegmentationCode3
Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual TasksCode3
LangSplat: 3D Language Gaussian SplattingCode3
InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentationCode3
Interactive Medical Image Segmentation: A Benchmark Dataset and BaselineCode3
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition TasksCode3
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything ModelCode3
A Survey of Camouflaged Object Detection and BeyondCode3
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?Code3
A Short Review and Evaluation of SAM2's Performance in 3D CT Image SegmentationCode3
A Simple Framework for Open-Vocabulary Segmentation and DetectionCode3
FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse LandscapesCode3
Generalized Decoding for Pixel, Image, and LanguageCode3
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
Anything-3D: Towards Single-view Anything Reconstruction in the WildCode3
FastViT: A Fast Hybrid Vision Transformer using Structural ReparameterizationCode3
Exploring Regional Clues in CLIP for Zero-Shot Semantic SegmentationCode3
FDA: Fourier Domain Adaptation for Semantic SegmentationCode3
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into OneCode3
DICEPTION: A Generalist Diffusion Model for Visual Perceptual TasksCode3
Show:102550
← PrevPage 6 of 591Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified