SOTAVerified

Semantic Segmentation

Papers

Showing 201250 of 14763 papers

TitleStatusHype
Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT ImagesCode2
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian SplattingCode2
Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image SegmentationCode2
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal ModelCode2
RoMA: Scaling up Mamba-based Foundation Models for Remote SensingCode2
DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask DiffusionCode2
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object SegmentationCode2
Golden Cudgel Network for Real-Time Semantic SegmentationCode2
SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation ModelsCode2
DAMamba: Vision State Space Model with Dynamic Adaptive ScanCode2
SAMRefiner: Taming Segment Anything Model for Universal Mask RefinementCode2
Segment Anything for HistopathologyCode2
iFormer: Integrating ConvNet and Transformer for Mobile ApplicationCode2
LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual TasksCode2
Scaling up self-supervised learning for improved surgical foundation modelsCode2
Densely Connected Parameter-Efficient Tuning for Referring Image SegmentationCode2
RWKV-UNet: Improving UNet with Long-Range Cooperation for Effective Medical Image SegmentationCode2
RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation ModelsCode2
Merging Context Clustering with Visual State Space Models for Medical Image SegmentationCode2
nnWNet: Rethinking the Use of Transformers in Biomedical Image Segmentation and Calling for a Unified Evaluation BenchmarkCode2
Towards Open-Vocabulary Remote Sensing Image Semantic SegmentationCode2
RelationField: Relate Anything in Radiance FieldsCode2
Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image SegmentationCode2
MaskTerial: A Foundation Model for Automated 2D Material Flake DetectionCode2
FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image SegmentationCode2
ConDSeg: A General Medical Image Segmentation Framework via Contrast-Driven Feature EnhancementCode2
SegFace: Face Segmentation of Long-Tail ClassesCode2
DreamColour: Controllable Video Colour Editing without TrainingCode2
Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic SegmentationCode2
Mask-Adapter: The Devil is in the Masks for Open-Vocabulary SegmentationCode2
SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation LearningCode2
FLAIR: VLM with Fine-grained Language-informed Image RepresentationsCode2
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image ClassificationCode2
TinyViM: Frequency Decoupling for Tiny Hybrid Vision MambaCode2
vesselFM: A Foundation Model for Universal 3D Blood Vessel SegmentationCode2
HyperSeg: Towards Universal Visual Segmentation with Large Language ModelCode2
An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion ModelsCode2
Scaling Spike-driven Transformer with Efficient Spike Firing Approximation TrainingCode2
Self-Calibrated CLIP for Training-Free Open-Vocabulary SegmentationCode2
ResCLIP: Residual Attention for Training-free Dense Vision-language InferenceCode2
IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet VideosCode2
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic SegmentationCode2
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary SegmentationCode2
CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic SegmentationCode2
Multimodality Helps Few-Shot 3D Point Cloud Semantic SegmentationCode2
Domain Adaptation with a Single Vision-Language EmbeddingCode2
Moving Object Segmentation in Point Cloud Data using Hidden Markov ModelsCode2
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulatorCode2
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelCode2
TIPS: Text-Image Pretraining with Spatial AwarenessCode2
Show:102550
← PrevPage 5 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified