SOTAVerified

Semantic Segmentation

Papers

Showing 51100 of 14763 papers

TitleStatusHype
Image Segmentation Keras : Implementation of Segnet, FCN, UNet, PSPNet and other models in KerasCode4
Semantic-SAM: Segment and Recognize Anything at Any GranularityCode4
The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One ShotCode4
SSL4EO-L: Datasets and Foundation Models for Landsat ImageryCode4
Segment Anything in Medical ImagesCode4
SegGPT: Segmenting Everything In ContextCode4
InceptionNeXt: When Inception Meets ConvNeXtCode4
RTMDet: An Empirical Study of Designing Real-Time Object DetectorsCode4
Images Speak in Images: A Generalist Painter for In-Context Visual LearningCode4
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable ConvolutionsCode4
SiamMask: A Framework for Fast Online Object Tracking and SegmentationCode4
GLIPv2: Unifying Localization and Vision-Language UnderstandingCode4
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and SegmentationCode4
EfficientViT: Multi-Scale Linear Attention for High-Resolution Dense PredictionCode4
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNNCode4
Highly Accurate Dichotomous Image SegmentationCode4
Visual Attention NetworkCode4
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
Panoptic Feature Pyramid NetworksCode4
Deep Residual Learning for Image RecognitionCode4
No time to train! Training-Free Reference-Based Instance SegmentationCode3
DFormerv2: Geometry Self-Attention for RGBD Semantic SegmentationCode3
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language InterfaceCode3
DICEPTION: A Generalist Diffusion Model for Visual Perceptual TasksCode3
ConceptAttention: Diffusion Transformers Learn Highly Interpretable FeaturesCode3
Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation ModelsCode3
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?Code3
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic SegmentationCode3
SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG GenerationCode3
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video SegmentationCode3
Interactive Medical Image Segmentation: A Benchmark Dataset and BaselineCode3
SMITE: Segment Me In TimECode3
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic SegmentationCode3
Rethinking the Evaluation of Visible and Infrared Image FusionCode3
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing ImagesCode3
Breaking reCAPTCHAv2Code3
InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentationCode3
A Survey of Camouflaged Object Detection and BeyondCode3
A Short Review and Evaluation of SAM2's Performance in 3D CT Image SegmentationCode3
SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image SegmentationCode3
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition TasksCode3
TCFormer: Visual Recognition via Token Clustering TransformerCode3
VISA: Reasoning Video Object Segmentation via Large Language ModelsCode3
xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba CounterpartCode3
Segment Anything without SupervisionCode3
Point-SAM: Promptable 3D Segmentation Model for Point CloudsCode3
RobustSAM: Segment Anything Robustly on Degraded ImagesCode3
Merlin: A Vision Language Foundation Model for 3D Computed TomographyCode3
VISTA3D: Versatile Imaging SegmenTation and Annotation model for 3D Computed TomographyCode3
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance SegmentationCode3
Show:102550
← PrevPage 2 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified