SOTAVerified

Semantic Segmentation

Papers

Showing 851900 of 14763 papers

TitleStatusHype
SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography ImagesCode1
WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize FieldsCode1
Leveraging Labelled Data Knowledge: A Cooperative Rectification Learning Network for Semi-supervised 3D Medical Image SegmentationCode1
QMaxViT-Unet+: A Query-Based MaxViT-Unet with Edge Enhancement for Scribble-Supervised Segmentation of Medical ImagesCode1
MITO: Enabling Non-Line-of-Sight Perception using Millimeter-waves through Real-World Datasets and Simulation ToolsCode1
SQ-GAN: Semantic Image Communications Using Masked Vector QuantizationCode1
Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentationCode1
HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and ClassificationCode1
Conditional diffusion model with spatial attention and latent embedding for medical image segmentationCode1
Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning DynamicsCode1
UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic SegmentationCode1
UD-Mamba: A pixel-level uncertainty-driven Mamba model for medical image segmentationCode1
Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and ClassificationCode1
FSPGD: Rethinking Black-box Attacks on Semantic SegmentationCode1
Complex Wavelet Mutual Information Loss: A Multi-Scale Loss Function for Semantic SegmentationCode1
ContourFormer:Real-Time Contour-Based End-to-End Instance Segmentation TransformerCode1
Efficient Redundancy Reduction for Open-Vocabulary Semantic SegmentationCode1
SeqSeg: Learning Local Segments for Automatic Vascular Model ConstructionCode1
3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous DrivingCode1
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object SegmentationCode1
MedicoSAM: Towards foundation models for medical image segmentationCode1
Automatic Labelling & Semantic Segmentation with 4D Radar TensorsCode1
Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student AttentionCode1
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural NetworksCode1
HSPFormer: Hierarchical Spatial Perception Transformer for Semantic SegmentationCode1
Learning Motion and Temporal Cues for Unsupervised Video Object SegmentationCode1
Advancing Semantic Future Prediction through Multimodal Visual Sequence TransformersCode1
TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry OperationsCode1
Skip Mamba Diffusion for Monocular 3D Semantic Scene CompletionCode1
Toward Realistic Camouflaged Object Detection: Benchmarks and MethodCode1
Multi-task Visual Grounding with Coarse-to-Fine Consistency ConstraintsCode1
D3RM: A Discrete Denoising Diffusion Refinement Model for Piano TranscriptionCode1
LM-Net: A Light-weight and Multi-scale Network for Medical Image SegmentationCode1
AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image SegmentationCode1
KM-UNet KAN Mamba UNet for medical image segmentationCode1
Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss FunctionCode1
EffiDec3D: An Optimized Decoder for High-Performance and Efficient 3D Medical Image SegmentationCode1
POT: Prototypical Optimal Transport for Weakly Supervised Semantic SegmentationCode1
CSC-PA: Cross-image Semantic Correlation via Prototype Attentions for Single-network Semi-supervised Breast Tumor SegmentationCode1
Relation3D : Enhancing Relation Modeling for Point Cloud Instance SegmentationCode1
FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic SegmentationCode1
Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space ModelsCode1
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot GeneralizationCode1
VisionGRU: A Linear-Complexity RNN Model for Efficient Image AnalysisCode1
QTSeg: A Query Token-Based Architecture for Efficient 2D Medical Image SegmentationCode1
AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic SegmentationCode1
Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic SegmentationCode1
Spike2Former: Efficient Spiking Transformer for High-performance Image SegmentationCode1
PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic SegmentationCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
Show:102550
← PrevPage 18 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified