SOTAVerified

Semantic Segmentation

Papers

Showing 29012950 of 14763 papers

TitleStatusHype
Contextual Transformer Networks for Visual RecognitionCode1
Efficient Multi-Task Scene Analysis with RGB-D TransformersCode1
Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic SegmentationCode1
CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion ModelsCode1
Dynamic Focus-aware Positional Queries for Semantic SegmentationCode1
LVIS: A Dataset for Large Vocabulary Instance SegmentationCode1
A framework for large-scale mapping of human settlement extent from Sentinel-2 images via fully convolutional neural networksCode1
Data Efficient 3D Learner via Knowledge Transferred from 2D ModelCode1
Dynamic Convolution for 3D Point Cloud Instance SegmentationCode1
LV-UNet: A Lightweight and Vanilla Model for Medical Image SegmentationCode1
Addressing Failure Detection by Learning Model ConfidenceCode1
Dynamic Dictionary Learning for Remote Sensing Image SegmentationCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
M^4oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of ExpertsCode1
MADAN: Multi-source Adversarial Domain Aggregation Network for Domain AdaptationCode1
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation SegmentationCode1
Dynamic Fusion Module Evolves Drivable Area and Road Anomaly Detection: A Benchmark and AlgorithmsCode1
MagicNet: Semi-Supervised Multi-Organ Segmentation via Magic-Cube Partition and RecoveryCode1
Cal-SFDA: Source-Free Domain-adaptive Semantic Segmentation with Differentiable Expected Calibration ErrorCode1
Making Vision Transformers Efficient from A Token Sparsification ViewCode1
Continuous Conditional Random Field Convolution for Point Cloud SegmentationCode1
Continuous Copy-Paste for One-Stage Multi-Object Tracking and SegmentationCode1
ASSANet: An Anisotropic Separable Set Abstraction for Efficient Point Cloud Representation LearningCode1
Dynamic 3D Scene Analysis by Point Cloud AccumulationCode1
DyCo3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic ConvolutionCode1
DynaMask: Dynamic Mask Selection for Instance SegmentationCode1
Dynamically Instance-Guided Adaptation: A Backward-Free Approach for Test-Time Domain Adaptive Semantic SegmentationCode1
Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic SegmentationCode1
Dynamic Graph CNN for Learning on Point CloudsCode1
CAMS: Convolution and Attention-Free Mamba-based Cardiac Image SegmentationCode1
Map-Guided Curriculum Domain Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image SegmentationCode1
Margin Preserving Self-paced Contrastive Learning Towards Domain Adaptation for Medical Image SegmentationCode1
Contour Proposal Networks for Biomedical Instance SegmentationCode1
MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point CloudsCode1
A Comparative Evaluation of Deep Learning Techniques for Photovoltaic Panel Detection from Aerial ImagesCode1
Dataset Enhancement with Instance-Level AugmentationsCode1
DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image SegmentationCode1
DuAT: Dual-Aggregation Transformer Network for Medical Image SegmentationCode1
Dual Transfer Learning for Event-based End-task Prediction via Pluggable Event to Image TranslationCode1
A Transductive Approach for Video Object SegmentationCode1
Contrastive Grouping with Transformer for Referring Image SegmentationCode1
Mask-Attention-Free Transformer for 3D Instance SegmentationCode1
BSUV-Net: A Fully-Convolutional Neural Network for Background Subtraction of Unseen VideosCode1
Duo-SegNet: Adversarial Dual-Views for Semi-Supervised Medical Image SegmentationCode1
A Tri-Layer Plugin to Improve Occluded DetectionCode1
Masked Based Unsupervised Content TransferCode1
3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous DrivingCode1
Masked Event Modeling: Self-Supervised Pretraining for Event CamerasCode1
DVIS: Decoupled Video Instance Segmentation FrameworkCode1
Dual Progressive Transformations for Weakly Supervised Semantic SegmentationCode1
Show:102550
← PrevPage 59 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified