| HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework | Apr 18, 2025 | RGBD Semantic SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Cardiac MRI Semantic Segmentation for Ventricles and Myocardium using Deep Learning | Apr 18, 2025 | Deep LearningSegmentation | —Unverified | 0 |
| Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art Performance | Apr 17, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation | Apr 17, 2025 | Semantic Segmentation | CodeCode Available | 1 |
| Hybrid Dense-UNet201 Optimization for Pap Smear Image Segmentation Using Spider Monkey Optimization | Apr 17, 2025 | Cell SegmentationImage Segmentation | —Unverified | 0 |
| SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling | Apr 17, 2025 | Disaster ResponseObject | —Unverified | 0 |
| Parsimonious Dataset Construction for Laparoscopic Cholecystectomy Structure Segmentation | Apr 17, 2025 | Active LearningInformativeness | —Unverified | 0 |
| Contour Field based Elliptical Shape Prior for the Segment Anything Model | Apr 17, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| Privacy-Preserving Operating Room Workflow Analysis using Digital Twins | Apr 17, 2025 | Depth EstimationEvent Detection | —Unverified | 0 |
| Digital Twin Generation from Visual Data: A Survey | Apr 17, 2025 | Semantic SegmentationSurvey | CodeCode Available | 2 |
| High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion | Apr 17, 2025 | Generative Adversarial NetworkImage Inpainting | —Unverified | 0 |
| Cross-Frequency Collaborative Training Network and Dataset for Semi-supervised First Molar Root Canal Segmentation | Apr 16, 2025 | DiagnosticImage Segmentation | —Unverified | 0 |
| Remote sensing colour image semantic segmentation of trails created by large herbivorous Mammals | Apr 16, 2025 | Image SegmentationManagement | —Unverified | 0 |
| Single-shot Star-convex Polygon-based Instance Segmentation for Spatially-correlated Biomedical Objects | Apr 16, 2025 | Instance SegmentationSemantic Segmentation | —Unverified | 0 |
| 3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap | Apr 16, 2025 | Point Cloud SegmentationSemantic Segmentation | CodeCode Available | 0 |
| pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild | Apr 16, 2025 | Benchmarkingobject-detection | —Unverified | 0 |
| DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Apr 16, 2025 | Few-Shot LearningInteractive Segmentation | CodeCode Available | 1 |
| CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting | Apr 16, 2025 | 3DGS3D Instance Segmentation | —Unverified | 0 |
| TextDiffSeg: Text-guided Latent Diffusion Model for 3d Medical Images Segmentation | Apr 16, 2025 | Image SegmentationLatent Diffusion Model for 3D | —Unverified | 0 |
| GrabS: Generative Embodied Agent for 3D Object Segmentation without Scene Supervision | Apr 16, 2025 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation | Apr 15, 2025 | DiagnosticImage Segmentation | CodeCode Available | 0 |
| PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation | Apr 15, 2025 | Foreground SegmentationImage Segmentation | CodeCode Available | 1 |
| Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception | Apr 15, 2025 | Data AugmentationDenoising | CodeCode Available | 1 |
| LightFormer: A lightweight and efficient decoder for remote sensing image segmentation | Apr 15, 2025 | Change DetectionDecoder | —Unverified | 0 |
| OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding | Apr 15, 2025 | Semantic SegmentationVideo Generation | —Unverified | 0 |