SOTAVerified

Semantic Segmentation

Papers

Showing 876900 of 14763 papers

TitleStatusHype
Learning Motion and Temporal Cues for Unsupervised Video Object SegmentationCode1
Advancing Semantic Future Prediction through Multimodal Visual Sequence TransformersCode1
TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry OperationsCode1
Skip Mamba Diffusion for Monocular 3D Semantic Scene CompletionCode1
Toward Realistic Camouflaged Object Detection: Benchmarks and MethodCode1
Multi-task Visual Grounding with Coarse-to-Fine Consistency ConstraintsCode1
D3RM: A Discrete Denoising Diffusion Refinement Model for Piano TranscriptionCode1
LM-Net: A Light-weight and Multi-scale Network for Medical Image SegmentationCode1
AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image SegmentationCode1
KM-UNet KAN Mamba UNet for medical image segmentationCode1
Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss FunctionCode1
EffiDec3D: An Optimized Decoder for High-Performance and Efficient 3D Medical Image SegmentationCode1
POT: Prototypical Optimal Transport for Weakly Supervised Semantic SegmentationCode1
CSC-PA: Cross-image Semantic Correlation via Prototype Attentions for Single-network Semi-supervised Breast Tumor SegmentationCode1
Relation3D : Enhancing Relation Modeling for Point Cloud Instance SegmentationCode1
FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic SegmentationCode1
Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space ModelsCode1
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot GeneralizationCode1
VisionGRU: A Linear-Complexity RNN Model for Efficient Image AnalysisCode1
QTSeg: A Query Token-Based Architecture for Efficient 2D Medical Image SegmentationCode1
AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic SegmentationCode1
Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic SegmentationCode1
Spike2Former: Efficient Spiking Transformer for High-performance Image SegmentationCode1
PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic SegmentationCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
Show:102550
← PrevPage 36 of 591Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified