SOTAVerified

Semantic Segmentation

Papers

Showing 101150 of 14763 papers

TitleStatusHype
Personalize Segment Anything Model with One ShotCode3
RS-Mamba for Large Remote Sensing Image Dense PredictionCode3
PointNeXt: Revisiting PointNet++ with Improved Training and Scaling StrategiesCode3
SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image SegmentationCode3
Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation ModelsCode3
SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and MoreCode3
PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360degCode3
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing ImagesCode3
Point-SAM: Promptable 3D Segmentation Model for Point CloudsCode3
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance SegmentationCode3
ONE-PEACE: Exploring One General Representation Model Toward Unlimited ModalitiesCode3
Nuclei instance segmentation and classification in histopathology images with StarDistCode3
OneFormer: One Transformer to Rule Universal Image SegmentationCode3
PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360^Code3
Point Transformer V3: Simpler, Faster, StrongerCode3
MTP: Advancing Remote Sensing Foundation Model via Multi-Task PretrainingCode3
Breaking reCAPTCHAv2Code3
Sigma: Siamese Mamba Network for Multi-Modal Semantic SegmentationCode3
Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous DrivingCode3
Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic SegmentationCode3
Merlin: A Vision Language Foundation Model for 3D Computed TomographyCode3
TCFormer: Visual Recognition via Token Clustering TransformerCode3
Moving Object Segmentation: All You Need Is SAM (and Flow)Code3
No time to train! Training-Free Reference-Based Instance SegmentationCode3
Transformers in Medical Imaging: A SurveyCode3
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language InterfaceCode3
Medical SAM Adapter: Adapting Segment Anything Model for Medical Image SegmentationCode3
MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic ModelCode3
MedSegDiff-V2: Diffusion based Medical Image Segmentation with TransformerCode3
PSALM: Pixelwise SegmentAtion with Large Multi-Modal ModelCode3
Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual TasksCode3
Interactive Medical Image Segmentation: A Benchmark Dataset and BaselineCode3
InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentationCode3
LangSplat: 3D Language Gaussian SplattingCode3
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?Code3
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything ModelCode3
LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image SegmentationCode3
A Simple Framework for Open-Vocabulary Segmentation and DetectionCode3
A Short Review and Evaluation of SAM2's Performance in 3D CT Image SegmentationCode3
FastViT: A Fast Hybrid Vision Transformer using Structural ReparameterizationCode3
FDA: Fourier Domain Adaptation for Semantic SegmentationCode3
FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse LandscapesCode3
Anything-3D: Towards Single-view Anything Reconstruction in the WildCode3
Exploring Regional Clues in CLIP for Zero-Shot Semantic SegmentationCode3
EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image SegmentationCode3
Generalized Decoding for Pixel, Image, and LanguageCode3
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into OneCode3
DFormerv2: Geometry Self-Attention for RGBD Semantic SegmentationCode3
DICEPTION: A Generalist Diffusion Model for Visual Perceptual TasksCode3
A Survey of Camouflaged Object Detection and BeyondCode3
Show:102550
← PrevPage 3 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified