SOTAVerified

Semantic Segmentation

Papers

Showing 12511275 of 14763 papers

TitleStatusHype
DINOv2 based Self Supervised Learning For Few Shot Medical Image SegmentationCode1
Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware LearningCode1
End-to-End Human Instance MattingCode1
AIO2: Online Correction of Object Labels for Deep Learning with Incomplete Annotation in Remote Sensing Image SegmentationCode1
Benchmarking Segmentation Models with Mask-Preserved Attribute EditingCode1
VideoMAC: Video Masked Autoencoders Meet ConvNetsCode1
FedLPPA: Learning Personalized Prompt and Aggregation for Federated Weakly-supervised Medical Image SegmentationCode1
Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class LabelCode1
Weakly Supervised Co-training with Swapping Assignments for Semantic SegmentationCode1
BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAMCode1
Placing Objects in Context via Inpainting for Out-of-distribution SegmentationCode1
LLMBind: A Unified Modality-Task Integration FrameworkCode1
DeiSAM: Segment Anything with Deictic PromptingCode1
BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing ImageryCode1
Object-level Geometric Structure Preserving for Natural Image StitchingCode1
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception TasksCode1
Perceiving Longer Sequences With Bi-Directional Cross-Attention TransformersCode1
Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation StrategiesCode1
ReViT: Enhancing Vision Transformers Feature Diversity with Attention Residual ConnectionsCode1
ChatEarthNet: A Global-Scale Image-Text Dataset Empowering Vision-Language Geo-Foundation ModelsCode1
LSRFormer: Efficient Transformer Supply Convolutional Neural Networks with Global Information for Aerial Image SegmentationCode1
Lester: rotoscope animation through video object segmentation and trackingCode1
MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained RepresentationsCode1
MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud UnderstandingCode1
TDViT: Temporal Dilated Video Transformer for Dense Video TasksCode1
Show:102550
← PrevPage 51 of 591Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified