SOTAVerified

Semantic Segmentation

Papers

Showing 38013850 of 14763 papers

TitleStatusHype
ShareCMP: Polarization-Aware RGB-P Semantic SegmentationCode1
AI-SAM: Automatic and Interactive Segment Anything ModelCode1
Uni3DL: Unified Model for 3D and Language Understanding0
PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood EstimationCode1
DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control0
Towards More Unified In-context Visual Understanding0
Graph Information Bottleneck for Remote Sensing Segmentation0
Towards Granularity-adjusted Pixel-level Semantic Annotation0
Panoptica -- instance-wise evaluation of 3D semantic and instance segmentation mapsCode1
SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary ConstraintsCode2
Breast Cancer Detection Using Deep Learning Technique Based On Ultrasound Image0
Geometrically-driven Aggregation for Zero-shot 3D Point Cloud UnderstandingCode1
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation0
SCLIP: Rethinking Self-Attention for Dense Vision-Language InferenceCode1
Class-Discriminative Attention Maps for Vision Transformers0
Learning Efficient Unsupervised Satellite Image-based Building Damage DetectionCode1
MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentationCode1
SRSNetwork: Siamese Reconstruction-Segmentation Networks based on Dynamic-Parameter ConvolutionCode0
Strong but simple: A Baseline for Domain Generalized Dense Perception by CLIP-based Transfer LearningCode1
Instance-guided Cartoon Editing with a Large-scale Dataset0
Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic SegmentationCode1
Few Clicks Suffice: Active Test-Time Adaptation for Semantic Segmentation0
ResEnsemble-DDPM: Residual Denoising Diffusion Probabilistic Models for Ensemble Learning0
Hulk: A Universal Knowledge Translator for Human-Centric TasksCode2
SANeRF-HQ: Segment Anything for NeRF in High Quality0
Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERTCode0
G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-trainingCode0
A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors0
A Data-efficient Framework for Robotics Large-scale LiDAR Scene Parsing0
T3D: Advancing 3D Medical Vision-Language Pre-training by Learning Multi-View Visual Consistency0
TranSegPGD: Improving Transferability of Adversarial Examples on Semantic Segmentation0
Semantic segmentation of SEM images of lower bainitic and tempered martensitic steels0
Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction with Extremely Limited LabelsCode1
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment AnythingCode4
Improve Supervised Representation Learning with Masked Image Modeling0
Towards Generalizable Referring Image Segmentation via Target Prompt and Visual Coherence0
SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive TransformersCode1
Grounding Everything: Emerging Localization Properties in Vision-Language TransformersCode1
A knowledge-based data-driven (KBDD) framework for all-day identification of cloud types using satellite remote sensing0
Improving Normalization with the James-Stein Estimator0
A Recent Survey of Vision Transformers for Medical Image Segmentation0
Efficient Multimodal Semantic Segmentation via Dual-Prompt LearningCode1
SCHEME: Scalable Channel Mixer for Vision Transformers0
Generative Parameter-Efficient Fine-TuningCode1
CellMixer: Annotation-free Semantic Cell Segmentation of Heterogeneous Cell Populations0
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals0
Learning Part Segmentation from Synthetic Animals0
InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence GenerationCode0
SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation0
Show:102550
← PrevPage 77 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified