SOTAVerified

Semantic Segmentation

Papers

Showing 401450 of 14763 papers

TitleStatusHype
DADU: Dual Attention-based Deep Supervised UNet for Automated Semantic Segmentation of Cardiac Images0
HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch FrameworkCode1
Digital Twin Generation from Visual Data: A SurveyCode2
SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling0
Contour Field based Elliptical Shape Prior for the Segment Anything Model0
Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art PerformanceCode0
High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion0
Hybrid Dense-UNet201 Optimization for Pap Smear Image Segmentation Using Spider Monkey Optimization0
Parsimonious Dataset Construction for Laparoscopic Cholecystectomy Structure Segmentation0
Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic SegmentationCode1
Privacy-Preserving Operating Room Workflow Analysis using Digital Twins0
Remote sensing colour image semantic segmentation of trails created by large herbivorous Mammals0
Cross-Frequency Collaborative Training Network and Dataset for Semi-supervised First Molar Root Canal Segmentation0
3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic GapCode0
CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting0
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild0
DC-SAM: In-Context Segment Anything in Images and Videos via Dual ConsistencyCode1
Single-shot Star-convex Polygon-based Instance Segmentation for Spatially-correlated Biomedical Objects0
GrabS: Generative Embodied Agent for 3D Object Segmentation without Scene SupervisionCode1
TextDiffSeg: Text-guided Latent Diffusion Model for 3d Medical Images Segmentation0
LightFormer: A lightweight and efficient decoder for remote sensing image segmentation0
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual PerceptionCode1
PraNet-V2: Dual-Supervised Reverse Attention for Medical Image SegmentationCode1
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild0
From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image SegmentationCode0
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding0
CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image0
Efficient and Robust Remote Sensing Image Denoising Using Randomized Approximation of Geodesics' Gramian on the Manifold Underlying the Patch Space0
Efficient Medical Image Restoration via Reliability Guided Learning in Frequency Domain0
IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme0
HDC: Hierarchical Distillation for Multi-level Noisy Consistency in Semi-Supervised Fetal Ultrasound Segmentation0
M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR DataCode0
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single TransformerCode2
FLOSS: Free Lunch in Open-vocabulary Semantic SegmentationCode1
MASSeg : 2nd Technical Report for 4th PVUW MOSE TrackCode0
Real-time Seafloor Segmentation and Mapping0
Advancing RFI-Detection in Radio Astronomy with Liquid State Machines0
Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials0
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution0
Mixture-of-Shape-Experts (MoSE): End-to-End Shape Dictionary Framework to Prompt SAM for Generalizable Medical Segmentation0
AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images0
PathSeqSAM: Sequential Modeling for Pathology Image Segmentation with SAM2Code0
Multi-Modal Brain Tumor Segmentation via 3D Multi-Scale Self-attention and Cross-attention0
A Unified Loss for Handling Inter-Class and Intra-Class Imbalance in Medical Image SegmentationCode0
Do Segmentation Models Understand Vascular Structure? A Blob-Based XAI Framework0
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization0
DSM: Building A Diverse Semantic Map for 3D Visual Grounding0
Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing0
Multi-person Physics-based Pose Estimation for Combat Sports0
Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data GenerationCode0
Show:102550
← PrevPage 9 of 296Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-H (M3I Pre-training)Params (M)1,310Unverified
2ViT-P (InternImage-H)Validation mIoU63.6Unverified
3ONE-PEACEValidation mIoU63Unverified
4InternImage-HValidation mIoU62.9Unverified
5M3I Pre-training (InternImage-H)Validation mIoU62.9Unverified
6BEiT-3Validation mIoU62.8Unverified
7EVAValidation mIoU62.3Unverified
8ViT-P (OneFormer, InternImage-H)Validation mIoU61.6Unverified
9ViT-Adapter-L (Mask2Former, BEiTv2 pretrain)Validation mIoU61.5Unverified
10FD-SwinV2-GValidation mIoU61.4Unverified