SOTAVerified

Semantic Segmentation

Papers

Showing 151200 of 14763 papers

TitleStatusHype
Generalized Decoding for Pixel, Image, and LanguageCode3
OneFormer: One Transformer to Rule Universal Image SegmentationCode3
MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic ModelCode3
Vision Transformers: From Semantic Segmentation to Dense PredictionCode3
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory ModelCode3
PointNeXt: Revisiting PointNet++ with Improved Training and Scaling StrategiesCode3
Vision Transformer Adapter for Dense PredictionsCode3
UNetFormer: A Unified Vision Transformer Model and Pre-Training Framework for 3D Medical Image SegmentationCode3
Nuclei instance segmentation and classification in histopathology images with StarDistCode3
Transformers in Medical Imaging: A SurveyCode3
XCiT: Cross-Covariance Image TransformersCode3
Vision Transformers for Dense PredictionCode3
UNETR: Transformers for 3D Medical Image SegmentationCode3
MA-Net: A Multi-Scale Attention Network for Liver and Tumor SegmentationCode3
ResNeSt: Split-Attention NetworksCode3
FDA: Fourier Domain Adaptation for Semantic SegmentationCode3
Recovering Realistic Texture in Image Super-resolution by Deep Spatial Feature TransformCode3
U-Net: Convolutional Networks for Biomedical Image SegmentationCode3
Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic ApproximationCode2
Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation BoosterCode2
Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20^th century Urban Landscapes with Satellite ImageriesCode2
Segment This Thing: Foveated Tokenization for Efficient Point-Prompted SegmentationCode2
VideoMolmo: Spatio-Temporal Grounding Meets PointingCode2
Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and VideosCode2
Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter EmbeddingCode2
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text ModelsCode2
The Missing Point in Vision Transformers for Universal Image SegmentationCode2
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System CollaborationCode2
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-LearningCode2
Recent Advances in Medical Imaging Segmentation: A SurveyCode2
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and SegmentationCode2
DeCLIP: Decoupled Learning for Open-Vocabulary Dense PerceptionCode2
Rethinking Boundary Detection in Deep Learning-Based Medical Image SegmentationCode2
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and OutlookCode2
Digital Twin Generation from Visual Data: A SurveyCode2
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single TransformerCode2
P2Object: Single Point Supervised Object Detection and Instance SegmentationCode2
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency AdaptationCode2
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal PromptingCode2
SlicerNNInteractive: A 3D Slicer extension for nnInteractiveCode2
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic SegmentationCode2
Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite ImageryCode2
Scene-Centric Unsupervised Panoptic SegmentationCode2
A Unified Image-Dense Annotation Generation Model for Underwater ScenesCode2
Towards Generating Realistic 3D Semantic Training Data for Autonomous DrivingCode2
Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic SegmentationCode2
COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian SplittingCode2
MaSS13K: A Matting-level Semantic Segmentation BenchmarkCode2
DINO in the Room: Leveraging 2D Foundation Models for 3D SegmentationCode2
Show:102550
← PrevPage 4 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified