SOTAVerified

Semantic Segmentation

Papers

Showing 35013550 of 14763 papers

TitleStatusHype
AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D PerceptionCode1
Landmark-free Statistical Shape Modeling via Neural Flow DeformationsCode1
ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything ModelCode1
Language-Bridged Spatial-Temporal Interaction for Referring Video Object SegmentationCode1
MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical Image SegmentationCode1
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image SegmentationCode1
Adaptive t-vMF Dice Loss for Multi-class Medical Image SegmentationCode1
Backdoor Attacks for Remote Sensing Data with Wavelet TransformCode1
Background Activation Suppression for Weakly Supervised Object Localization and Semantic SegmentationCode1
Language Guided Domain Generalized Medical Image SegmentationCode1
LaPE: Layer-adaptive Position Embedding for Vision Transformers with Independent Layer NormalizationCode1
Laplacian2Mesh: Laplacian-Based Mesh UnderstandingCode1
LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic SegmentationCode1
Large-batch Optimization for Dense Visual PredictionsCode1
Background-Aware Pooling and Noise-Aware Loss for Weakly-Supervised Semantic SegmentationCode1
MaskingDepth: Masked Consistency Regularization for Semi-supervised Monocular Depth EstimationCode1
Alternate Diverse Teaching for Semi-supervised Medical Image SegmentationCode1
Large-scale 6D Object Pose Estimation Dataset for Industrial Bin-PickingCode1
Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot SegmentationCode1
MCP-MedSAM: A Powerful Lightweight Medical Segment Anything Model Trained with a Single GPU in Just One DayCode1
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation ModelsCode1
Large Separable Kernel Attention: Rethinking the Large Kernel Attention Design in CNNCode1
Extract Free Dense Labels from CLIPCode1
Deep Learning based Food Instance Segmentation using Synthetic DataCode1
mc-BEiT: Multi-choice Discretization for Image BERT Pre-trainingCode1
1st Place Solution for the 5th LSVOS Challenge: Video Instance SegmentationCode1
Active Negative Loss: A Robust Framework for Learning with Noisy LabelsCode1
Latent Discriminant deterministic UncertaintyCode1
Learning Class-Agnostic Pseudo Mask Generation for Box-Supervised Semantic SegmentationCode1
Learning Fast and Robust Target Models for Video Object SegmentationCode1
LayerCAM: Exploring Hierarchical Class Activation Maps for LocalizationCode1
Semi-Supervised Semantic Segmentation via Gentle Teaching AssistantCode1
Lawin Transformer: Improving Semantic Segmentation Transformer with Multi-Scale Representations via Large Window AttentionCode1
LAVT: Language-Aware Vision Transformer for Referring Image SegmentationCode1
Active Pointly-Supervised Instance SegmentationCode1
Leaf Only SAM: A Segment Anything Pipeline for Zero-Shot Automated Leaf SegmentationCode1
Learning Calibrated Medical Image Segmentation via Multi-Rater Agreement ModelingCode1
Semi-supervised Semantic Segmentation with Prototype-based Consistency RegularizationCode1
Learnable Earth Parser: Discovering 3D Prototypes in Aerial ScansCode1
Learnable Polyphase Sampling for Shift Invariant and Equivariant Convolutional NetworksCode1
Bag of Tricks for Image Classification with Convolutional Neural NetworksCode1
DenseCLIP: Language-Guided Dense Prediction with Context-Aware PromptingCode1
Learn from Foundation Model: Fruit Detection Model without Manual AnnotationCode1
SeMLaPS: Real-time Semantic Mapping with Latent Prior Networks and Quasi-Planar SegmentationCode1
Balanced Energy Regularization Loss for Out-of-distribution DetectionCode1
Learning Across Domains and Devices: Style-Driven Source-Free Domain Adaptation in Clustered Federated LearningCode1
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object SegmentationCode1
Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance ModellingCode1
Balanced Meta-Softmax for Long-Tailed Visual RecognitionCode1
Dense Dilated Convolutions Merging Network for Land Cover ClassificationCode1
Show:102550
← PrevPage 71 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified