SOTAVerified

Semantic Segmentation

Papers

Showing 28262850 of 14763 papers

TitleStatusHype
Focal Attention for Long-Range Interactions in Vision TransformersCode1
Voint Cloud: Multi-View Point Cloud Representation for 3D UnderstandingCode1
PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite ImagesCode1
CRIS: CLIP-Driven Referring Image SegmentationCode1
The Devil is in the Margin: Margin-based Label Smoothing for Network CalibrationCode1
DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic SegmentationCode1
Searching the Search Space of Vision TransformerCode1
End-to-End Referring Video Object Segmentation with Multimodal TransformersCode1
A Robust Volumetric Transformer for Accurate 3D Tumor SegmentationCode1
Modeling Annotator Preference and Stochastic Annotation Error for Medical Image SegmentationCode1
Contrastive Object-level Pre-training with Spatial Noise Curriculum LearningCode1
Efficient Self-Ensemble for Semantic SegmentationCode1
Mask Transfiner for High-Quality Instance SegmentationCode1
Towards Fewer Annotations: Active Learning via Region Impurity and Prediction Uncertainty for Domain Adaptive Semantic SegmentationCode1
NomMer: Nominate Synergistic Context in Vision Transformer for Visual RecognitionCode1
Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene RepresentationsCode1
Semantic-Aware Generation for Self-Supervised Visual Representation LearningCode1
Perturbed and Strict Mean Teachers for Semi-supervised Semantic SegmentationCode1
BoxeR: Box-Attention for 2D and 3D TransformersCode1
Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentation on Complex Urban Driving ScenesCode1
PeCo: Perceptual Codebook for BERT Pre-training of Vision TransformersCode1
Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-LabelingCode1
EvDistill: Asynchronous Events to End-task Learning via Bidirectional Reconstruction-guided Cross-modal Knowledge DistillationCode1
Conditional Object-Centric Learning from VideoCode1
SPCL: A New Framework for Domain Adaptive Semantic Segmentation via Semantic Prototype-based Contrastive LearningCode1
Show:102550
← PrevPage 114 of 591Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified