SOTAVerified

Semantic Segmentation

Papers

Showing 101150 of 14763 papers

TitleStatusHype
PlainMamba: Improving Non-Hierarchical Mamba in Visual RecognitionCode3
RS-Mamba for Large Remote Sensing Image Dense PredictionCode3
PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360degCode3
PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360^Code3
SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and MoreCode3
SAM-Med2DCode3
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing ImagesCode3
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked AutoencodersCode3
PointNeXt: Revisiting PointNet++ with Improved Training and Scaling StrategiesCode3
ONE-PEACE: Exploring One General Representation Model Toward Unlimited ModalitiesCode3
OneFormer: One Transformer to Rule Universal Image SegmentationCode3
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance SegmentationCode3
Nuclei instance segmentation and classification in histopathology images with StarDistCode3
Point-SAM: Promptable 3D Segmentation Model for Point CloudsCode3
Moving Object Segmentation: All You Need Is SAM (and Flow)Code3
Breaking reCAPTCHAv2Code3
MTP: Advancing Remote Sensing Foundation Model via Multi-Task PretrainingCode3
Sigma: Siamese Mamba Network for Multi-Modal Semantic SegmentationCode3
MedSegDiff-V2: Diffusion based Medical Image Segmentation with TransformerCode3
Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic SegmentationCode3
Merlin: A Vision Language Foundation Model for 3D Computed TomographyCode3
TCFormer: Visual Recognition via Token Clustering TransformerCode3
Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous DrivingCode3
Medical SAM Adapter: Adapting Segment Anything Model for Medical Image SegmentationCode3
Transformers in Medical Imaging: A SurveyCode3
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language InterfaceCode3
MA-Net: A Multi-Scale Attention Network for Liver and Tumor SegmentationCode3
MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic ModelCode3
No time to train! Training-Free Reference-Based Instance SegmentationCode3
Point Transformer V3: Simpler, Faster, StrongerCode3
LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image SegmentationCode3
Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual TasksCode3
LangSplat: 3D Language Gaussian SplattingCode3
InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentationCode3
Interactive Medical Image Segmentation: A Benchmark Dataset and BaselineCode3
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition TasksCode3
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?Code3
A Survey of Camouflaged Object Detection and BeyondCode3
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything ModelCode3
FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse LandscapesCode3
A Short Review and Evaluation of SAM2's Performance in 3D CT Image SegmentationCode3
Generalized Decoding for Pixel, Image, and LanguageCode3
A Simple Framework for Open-Vocabulary Segmentation and DetectionCode3
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
Anything-3D: Towards Single-view Anything Reconstruction in the WildCode3
FastViT: A Fast Hybrid Vision Transformer using Structural ReparameterizationCode3
Exploring Regional Clues in CLIP for Zero-Shot Semantic SegmentationCode3
FDA: Fourier Domain Adaptation for Semantic SegmentationCode3
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into OneCode3
DICEPTION: A Generalist Diffusion Model for Visual Perceptual TasksCode3
Show:102550
← PrevPage 3 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified