SOTAVerified

Semantic Segmentation

Papers

Showing 176200 of 14763 papers

TitleStatusHype
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text ModelsCode2
The Missing Point in Vision Transformers for Universal Image SegmentationCode2
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System CollaborationCode2
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-LearningCode2
Recent Advances in Medical Imaging Segmentation: A SurveyCode2
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and SegmentationCode2
DeCLIP: Decoupled Learning for Open-Vocabulary Dense PerceptionCode2
Rethinking Boundary Detection in Deep Learning-Based Medical Image SegmentationCode2
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and OutlookCode2
Digital Twin Generation from Visual Data: A SurveyCode2
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single TransformerCode2
P2Object: Single Point Supervised Object Detection and Instance SegmentationCode2
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency AdaptationCode2
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal PromptingCode2
SlicerNNInteractive: A 3D Slicer extension for nnInteractiveCode2
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic SegmentationCode2
Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite ImageryCode2
Scene-Centric Unsupervised Panoptic SegmentationCode2
A Unified Image-Dense Annotation Generation Model for Underwater ScenesCode2
Towards Generating Realistic 3D Semantic Training Data for Autonomous DrivingCode2
Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic SegmentationCode2
COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian SplittingCode2
MaSS13K: A Matting-level Semantic Segmentation BenchmarkCode2
DINO in the Room: Leveraging 2D Foundation Models for 3D SegmentationCode2
Show:102550
← PrevPage 8 of 591Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified