SOTAVerified

Semantic Segmentation

Papers

Showing 31013150 of 14763 papers

TitleStatusHype
Visual Parser: Representing Part-whole Hierarchies with TransformersCode1
NucMM Dataset: 3D Neuronal Nuclei Instance Segmentation at Sub-Cubic Millimeter ScaleCode1
Geographical Knowledge-driven Representation Learning for Remote Sensing ImagesCode1
Transfer Learning from Synthetic to Real LiDAR Point Cloud for Semantic SegmentationCode1
Locally Enhanced Self-Attention: Combining Self-Attention and Convolution as Local and Context TermsCode1
End-to-end Trainable Deep Neural Network for Robotic Grasp Detection and Semantic Segmentation from RGBCode1
TransAttUnet: Multi-level Attention-guided U-Net with Transformer for Medical Image SegmentationCode1
A Spatial Guided Self-supervised Clustering Network for Medical Image SegmentationCode1
Anatomy of Domain Shift Impact on U-Net Layers in MRI SegmentationCode1
Form2Seq : A Framework for Higher-Order Form Structure ExtractionCode1
Capturing, Reconstructing, and Simulating: the UrbanScene3D DatasetCode1
Trans4Trans: Efficient Transformer for Transparent Object Segmentation to Help Visually Impaired People Navigate in the Real WorldCode1
Plot2Spectra: an Automatic Spectra Extraction ToolCode1
Contrastive Multimodal Fusion with TupleInfoNCECode1
On Model Calibration for Long-Tailed Object Detection and Instance SegmentationCode1
Similarity-Aware Fusion Network for 3D Semantic SegmentationCode1
Cooperative Training and Latent Space Data Augmentation for Robust Medical Image SegmentationCode1
A Survey on Deep Learning Technique for Video SegmentationCode1
UTNet: A Hybrid Transformer Architecture for Medical Image SegmentationCode1
Polarized Self-Attention: Towards High-quality Pixel-wise RegressionCode1
MASS: Multi-Attentional Semantic Segmentation of LiDAR Data for Dense Top-View UnderstandingCode1
Inter Extreme Points Geodesics for End-to-End Weakly Supervised Image SegmentationCode1
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped WindowsCode1
DivergentNets: Medical Image Segmentation by Network EnsembleCode1
Focal Self-attention for Local-Global Interactions in Vision TransformersCode1
SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via StereoCode1
Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing ImagesCode1
SinGAN-Seg: Synthetic training data generation for medical image segmentationCode1
Tackling Catastrophic Forgetting and Background Shift in Continual Semantic SegmentationCode1
Multi-Compound Transformer for Accurate Biomedical Image SegmentationCode1
K-Net: Towards Unified Image SegmentationCode1
Striking the Right Balance: Recall Loss for Semantic SegmentationCode1
Semi-supervised Semantic Segmentation with Directional Context-aware ConsistencyCode1
Indoor Panorama Planar 3D Reconstruction via Divide and ConquerCode1
BiX-NAS: Searching Efficient Bi-directional Architecture for Medical Image SegmentationCode1
Semantics-aware Multi-modal Domain Translation:From LiDAR Point Clouds to Panoramic Color ImagesCode1
Semi-supervised Meta-learning with Disentanglement for Domain-generalised Medical Image SegmentationCode1
VOLO: Vision Outlooker for Visual RecognitionCode1
FusionPainting: Multimodal Fusion with Adaptive Attention for 3D Object DetectionCode1
Probabilistic Attention for Interactive SegmentationCode1
Real-time Instance Segmentation with Discriminative Orientation MapsCode1
LayerCAM: Exploring Hierarchical Class Activation Maps for LocalizationCode1
SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental LearningCode1
P2T: Pyramid Pooling Transformer for Scene UnderstandingCode1
Tracking Instances as QueriesCode1
EPMF: Efficient Perception-aware Multi-sensor Fusion for 3D Semantic SegmentationCode1
Quality-Aware Memory Network for Interactive Volumetric Image SegmentationCode1
Delving Deep Into Many-to-Many Attention for Few-Shot Video Object SegmentationCode1
Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud VideosCode1
Reciprocal Transformations for Unsupervised Video Object SegmentationCode1
Show:102550
← PrevPage 63 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified