SOTAVerified

Semantic Segmentation

Papers

Showing 83518400 of 14763 papers

TitleStatusHype
M2GAN: A Multi-Stage Self-Attention Network for Image Rain Removal on Autonomous Vehicles0
M^2UNet: MetaFormer Multi-scale Upsampling Network for Polyp Segmentation0
M^33D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding0
M^3Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and Editing0
TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and Tracking0
MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation0
MACD R-CNN: An Abnormal Cell Nucleus Detection Method0
MacFormer: Semantic Segmentation with Fine Object Boundaries0
Machine-learned 3D Building Vectorization from Satellite Imagery0
Machine Learning-based Automatic Graphene Detection with Color Correction for Optical Microscope Images0
Machine Learning Techniques for Biomedical Image Segmentation: An Overview of Technical Aspects and Introduction to State-of-Art Applications0
MAELi: Masked Autoencoder for Large-Scale LiDAR Point Clouds0
MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition0
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection0
MagicFace: Training-free Universal-Style Human Image Customized Synthesis0
MaIL: A Unified Mask-Image-Language Trimodal Network for Referring Image Segmentation0
MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models0
Make Both Ends Meet: A Synergistic Optimization Infrared Small Target Detection with Streamlined Computational Overhead0
Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models0
Making Vision Transformers Truly Shift-Equivariant0
MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance0
Pamba: Enhancing Global Interaction in Point Clouds via State Space Model0
MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models0
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection0
Mamba-Reg: Vision Mamba Also Needs Registers0
MAMBO-NET: Multi-Causal Aware Modeling Backdoor-Intervention Optimization for Medical Image Segmentation Network0
Mammogram Edge Detection Using Hybrid Soft Computing Methods0
MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation0
Mango Tree Net -- A fully convolutional network for semantic segmentation and individual crown detection of mango trees0
Manifold-driven Attention Maps for Weakly Supervised Segmentation0
Attentive Semantic Exploring for Manipulated Face Detection0
Many Perception Tasks are Highly Redundant Functions of their Input Data0
Map-aided annotation for pole base detection0
Mapping Auto-context Decision Forests to Deep ConvNets for Semantic Segmentation0
Mapping horizontal and vertical urban densification in Denmark with Landsat time-series from 1985 to 2018: a semantic segmentation solution0
Mapping Temporary Slums from Satellite Imagery using a Semi-Supervised Approach0
Mapping the ocular surface from monocular videos with an application to dry eye disease grading0
Mapping Vulnerable Populations with AI0
MAPUNetR: A Hybrid Vision Transformer and U-Net Architecture for Efficient and Interpretable Medical Image Segmentation0
Marginal Thresholding in Noisy Image Segmentation0
Marginal Weighted Maximum Log-likelihood for Efficient Learning of Perturb-and-Map models0
Marine Debris Detection in Satellite Surveillance using Attention Mechanisms0
RISAM: Referring Image Segmentation via Mutual-Aware Attention Features0
MARNet: Multi-Abstraction Refinement Network for 3D Point Cloud Analysis0
MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector0
Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation0
Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors0
MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking0
MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation0
Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval0
Show:102550
← PrevPage 168 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified