SOTAVerified

Semantic Segmentation

Papers

Showing 426450 of 14763 papers

TitleStatusHype
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding0
CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image0
Efficient and Robust Remote Sensing Image Denoising Using Randomized Approximation of Geodesics' Gramian on the Manifold Underlying the Patch Space0
Efficient Medical Image Restoration via Reliability Guided Learning in Frequency Domain0
IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme0
HDC: Hierarchical Distillation for Multi-level Noisy Consistency in Semi-Supervised Fetal Ultrasound Segmentation0
M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR DataCode0
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single TransformerCode2
FLOSS: Free Lunch in Open-vocabulary Semantic SegmentationCode1
MASSeg : 2nd Technical Report for 4th PVUW MOSE TrackCode0
Real-time Seafloor Segmentation and Mapping0
Advancing RFI-Detection in Radio Astronomy with Liquid State Machines0
Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials0
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution0
Mixture-of-Shape-Experts (MoSE): End-to-End Shape Dictionary Framework to Prompt SAM for Generalizable Medical Segmentation0
AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images0
PathSeqSAM: Sequential Modeling for Pathology Image Segmentation with SAM2Code0
Multi-Modal Brain Tumor Segmentation via 3D Multi-Scale Self-attention and Cross-attention0
A Unified Loss for Handling Inter-Class and Intra-Class Imbalance in Medical Image SegmentationCode0
Do Segmentation Models Understand Vascular Structure? A Blob-Based XAI Framework0
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization0
DSM: Building A Diverse Semantic Map for 3D Visual Grounding0
Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing0
Multi-person Physics-based Pose Estimation for Combat Sports0
Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data GenerationCode0
Show:102550
← PrevPage 18 of 591Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
5InternImage-HValidation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified