| DADU: Dual Attention-based Deep Supervised UNet for Automated Semantic Segmentation of Cardiac Images | Apr 18, 2025 | Edge DetectionImage Segmentation | —Unverified | 0 |
| HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch Framework | Apr 18, 2025 | RGBD Semantic SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Digital Twin Generation from Visual Data: A Survey | Apr 17, 2025 | Semantic SegmentationSurvey | CodeCode Available | 2 |
| SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling | Apr 17, 2025 | Disaster ResponseObject | —Unverified | 0 |
| Contour Field based Elliptical Shape Prior for the Segment Anything Model | Apr 17, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art Performance | Apr 17, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion | Apr 17, 2025 | Generative Adversarial NetworkImage Inpainting | —Unverified | 0 |
| Hybrid Dense-UNet201 Optimization for Pap Smear Image Segmentation Using Spider Monkey Optimization | Apr 17, 2025 | Cell SegmentationImage Segmentation | —Unverified | 0 |
| Parsimonious Dataset Construction for Laparoscopic Cholecystectomy Structure Segmentation | Apr 17, 2025 | Active LearningInformativeness | —Unverified | 0 |
| Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation | Apr 17, 2025 | Semantic Segmentation | CodeCode Available | 1 |
| Privacy-Preserving Operating Room Workflow Analysis using Digital Twins | Apr 17, 2025 | Depth EstimationEvent Detection | —Unverified | 0 |
| Remote sensing colour image semantic segmentation of trails created by large herbivorous Mammals | Apr 16, 2025 | Image SegmentationManagement | —Unverified | 0 |
| Cross-Frequency Collaborative Training Network and Dataset for Semi-supervised First Molar Root Canal Segmentation | Apr 16, 2025 | DiagnosticImage Segmentation | —Unverified | 0 |
| 3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap | Apr 16, 2025 | Point Cloud SegmentationSemantic Segmentation | CodeCode Available | 0 |
| CAGS: Open-Vocabulary 3D Scene Understanding with Context-Aware Gaussian Splatting | Apr 16, 2025 | 3DGS3D Instance Segmentation | —Unverified | 0 |
| pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild | Apr 16, 2025 | Benchmarkingobject-detection | —Unverified | 0 |
| DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Apr 16, 2025 | Few-Shot LearningInteractive Segmentation | CodeCode Available | 1 |
| Single-shot Star-convex Polygon-based Instance Segmentation for Spatially-correlated Biomedical Objects | Apr 16, 2025 | Instance SegmentationSemantic Segmentation | —Unverified | 0 |
| GrabS: Generative Embodied Agent for 3D Object Segmentation without Scene Supervision | Apr 16, 2025 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| TextDiffSeg: Text-guided Latent Diffusion Model for 3d Medical Images Segmentation | Apr 16, 2025 | Image SegmentationLatent Diffusion Model for 3D | —Unverified | 0 |
| LightFormer: A lightweight and efficient decoder for remote sensing image segmentation | Apr 15, 2025 | Change DetectionDecoder | —Unverified | 0 |
| Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception | Apr 15, 2025 | Data AugmentationDenoising | CodeCode Available | 1 |
| PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation | Apr 15, 2025 | Foreground SegmentationImage Segmentation | CodeCode Available | 1 |
| PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild | Apr 15, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation | Apr 15, 2025 | DiagnosticImage Segmentation | CodeCode Available | 0 |
| OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding | Apr 15, 2025 | Semantic SegmentationVideo Generation | —Unverified | 0 |
| CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image | Apr 15, 2025 | Instance SegmentationPose Estimation | —Unverified | 0 |
| Efficient and Robust Remote Sensing Image Denoising Using Randomized Approximation of Geodesics' Gramian on the Manifold Underlying the Patch Space | Apr 15, 2025 | DenoisingImage Denoising | —Unverified | 0 |
| Efficient Medical Image Restoration via Reliability Guided Learning in Frequency Domain | Apr 15, 2025 | DenoisingImage Denoising | —Unverified | 0 |
| IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme | Apr 14, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| HDC: Hierarchical Distillation for Multi-level Noisy Consistency in Semi-Supervised Fetal Ultrasound Segmentation | Apr 14, 2025 | AnatomyImage Segmentation | —Unverified | 0 |
| M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR Data | Apr 14, 2025 | Autonomous VehiclesSemantic Segmentation | CodeCode Available | 0 |
| The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer | Apr 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation | Apr 14, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| MASSeg : 2nd Technical Report for 4th PVUW MOSE Track | Apr 14, 2025 | Data AugmentationObject | CodeCode Available | 0 |
| Real-time Seafloor Segmentation and Mapping | Apr 14, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| Advancing RFI-Detection in Radio Astronomy with Liquid State Machines | Apr 14, 2025 | AstronomySemantic Segmentation | —Unverified | 0 |
| Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials | Apr 14, 2025 | Image SegmentationPrompt Engineering | —Unverified | 0 |
| FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution | Apr 13, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Mixture-of-Shape-Experts (MoSE): End-to-End Shape Dictionary Framework to Prompt SAM for Generalizable Medical Segmentation | Apr 13, 2025 | Dictionary LearningDomain Generalization | —Unverified | 0 |
| AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images | Apr 12, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| PathSeqSAM: Sequential Modeling for Pathology Image Segmentation with SAM2 | Apr 12, 2025 | Domain AdaptationImage Segmentation | CodeCode Available | 0 |
| Multi-Modal Brain Tumor Segmentation via 3D Multi-Scale Self-attention and Cross-attention | Apr 12, 2025 | Brain Tumor SegmentationDecoder | —Unverified | 0 |
| A Unified Loss for Handling Inter-Class and Intra-Class Imbalance in Medical Image Segmentation | Apr 11, 2025 | Image SegmentationMedical Image Analysis | CodeCode Available | 0 |
| Do Segmentation Models Understand Vascular Structure? A Blob-Based XAI Framework | Apr 11, 2025 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization | Apr 11, 2025 | DenoisingObject | —Unverified | 0 |
| DSM: Building A Diverse Semantic Map for 3D Visual Grounding | Apr 11, 2025 | 3D visual groundingScene Understanding | —Unverified | 0 |
| Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing | Apr 11, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Multi-person Physics-based Pose Estimation for Combat Sports | Apr 11, 2025 | 3D Human Pose Estimation3D Multi-Person Pose Estimation | —Unverified | 0 |
| Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation | Apr 11, 2025 | Depth EstimationInstance Segmentation | CodeCode Available | 0 |