| Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing | Apr 11, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| SynthFM: Training Modality-agnostic Foundation Models for Medical Image Segmentation without Real Medical Data | Apr 11, 2025 | DecoderImage Segmentation | —Unverified | 0 |
| Multi-person Physics-based Pose Estimation for Combat Sports | Apr 11, 2025 | 3D Human Pose Estimation3D Multi-Person Pose Estimation | —Unverified | 0 |
| ZS-VCOS: Zero-Shot Outperforms Supervised Video Camouflaged Object Segmentation | Apr 10, 2025 | Camouflaged Object SegmentationDefect Detection | CodeCode Available | 0 |
| Conditional Conformal Risk Adaptation | Apr 10, 2025 | Conformal PredictionImage Segmentation | —Unverified | 0 |
| Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation | Apr 10, 2025 | Knowledge DistillationSemantic Segmentation | —Unverified | 0 |
| Nonlocal Retinex-Based Variational Model and its Deep Unfolding Twin for Low-Light Image Enhancement | Apr 10, 2025 | Image EnhancementImage Segmentation | —Unverified | 0 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| SydneyScapes: Image Segmentation for Australian Environments | Apr 10, 2025 | Autonomous VehiclesBenchmarking | —Unverified | 0 |
| RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability | Apr 10, 2025 | Contrastive LearningOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| PRAD: Periapical Radiograph Analysis Dataset and Benchmark Model Development | Apr 10, 2025 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| P2Object: Single Point Supervised Object Detection and Instance Segmentation | Apr 10, 2025 | Instance SegmentationMultiple Instance Learning | CodeCode Available | 2 |
| MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking | Apr 9, 2025 | Autonomous DrivingLanguage Modeling | CodeCode Available | 0 |
| Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting | Apr 9, 2025 | 3DGS3D Instance Segmentation | CodeCode Available | 1 |
| Zeus: Zero-shot LLM Instruction for Union Segmentation in Multimodal Medical Imaging | Apr 9, 2025 | DiagnosticImage Segmentation | —Unverified | 0 |
| RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration | Apr 9, 2025 | 3D Semantic SegmentationBenchmarking | —Unverified | 0 |
| Domain Generalization through Attenuation of Domain-Specific Information | Apr 9, 2025 | Domain GeneralizationSemantic Segmentation | CodeCode Available | 0 |
| WoundAmbit: Bridging State-of-the-Art Semantic Segmentation and Real-World Wound Care | Apr 8, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Saliency-Motion Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation | Apr 8, 2025 | Optical Flow EstimationSalient Object Detection | —Unverified | 0 |
| Towards Varroa destructor mite detection using a narrow spectra illumination | Apr 8, 2025 | Semantic Segmentation | —Unverified | 0 |
| Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation | Apr 8, 2025 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians | Apr 8, 2025 | 3DGSComputational Efficiency | —Unverified | 0 |
| CTI-Unet: Cascaded Threshold Integration for Improved U-Net Segmentation of Pathology Images | Apr 8, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling | Apr 8, 2025 | DecoderGPU | CodeCode Available | 1 |
| Rethinking the Nested U-Net Approach: Enhancing Biomarker Segmentation with Attention Mechanisms and Multiscale Feature Fusion | Apr 8, 2025 | 2D Semantic SegmentationImage Segmentation | CodeCode Available | 0 |
| MSA-UNet3+: Multi-Scale Attention UNet3+ with New Supervised Prototypical Contrastive Loss for Coronary DSA Image Segmentation | Apr 7, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 0 |
| SlicerNNInteractive: A 3D Slicer extension for nnInteractive | Apr 7, 2025 | Image SegmentationSemantic Segmentation | CodeCode Available | 2 |
| S^4M: Boosting Semi-Supervised Instance Segmentation with SAM | Apr 7, 2025 | Data AugmentationInstance Segmentation | —Unverified | 0 |
| The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video Segmentation | Apr 7, 2025 | Inference OptimizationReferring Video Object Segmentation | CodeCode Available | 5 |
| Explaining Uncertainty in Multiple Sclerosis Lesion Segmentation Beyond Prediction Errors | Apr 7, 2025 | Image SegmentationInformativeness | —Unverified | 0 |
| Balancing Robustness and Efficiency in Embedded DNNs Through Activation Function Selection | Apr 7, 2025 | Autonomous DrivingDecoder | —Unverified | 0 |
| BoxSeg: Quality-Aware and Peer-Assisted Learning for Box-supervised Instance Segmentation | Apr 7, 2025 | Box-supervised Instance SegmentationInstance Segmentation | CodeCode Available | 0 |
| Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting | Apr 7, 2025 | Boundary DetectionObject | CodeCode Available | 2 |
| DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation | Apr 7, 2025 | 3D geometryRGBD Semantic Segmentation | CodeCode Available | 3 |
| Here Comes the Explanation: A Shapley Perspective on Multi-contrast Medical Image Segmentation | Apr 6, 2025 | Brain Tumor SegmentationImage Segmentation | —Unverified | 0 |
| UCS: A Universal Model for Curvilinear Structure Segmentation | Apr 5, 2025 | Feature CompressionSegmentation | —Unverified | 0 |
| Performance Analysis of Deep Learning Models for Femur Segmentation in MRI Scan | Apr 5, 2025 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| View2CAD: Reconstructing View-Centric CAD Models from Single RGB-D Scans | Apr 5, 2025 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation | Apr 4, 2025 | Domain GeneralizationMamba | CodeCode Available | 2 |
| Multi-encoder nnU-Net outperforms Transformer models with self-supervised pretraining | Apr 4, 2025 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| Multi-Granularity Vision Fastformer with Fusion Mechanism for Skin Lesion Segmentation | Apr 4, 2025 | Image SegmentationLesion Segmentation | —Unverified | 0 |
| GraphSeg: Segmented 3D Representations via Graph Edge Addition and Contraction | Apr 4, 2025 | Image SegmentationSegmentation | CodeCode Available | 0 |
| Evaluating and Enhancing Segmentation Model Robustness with Metamorphic Testing | Apr 3, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation | Apr 3, 2025 | Semantic Segmentation | CodeCode Available | 1 |
| Taylor Series-Inspired Local Structure Fitting Network for Few-shot Point Cloud Semantic Segmentation | Apr 3, 2025 | Semantic Segmentation | CodeCode Available | 0 |
| APSeg: Auto-Prompt Model with Acquired and Injected Knowledge for Nuclear Instance Segmentation and Classification | Apr 3, 2025 | ClassificationInstance Segmentation | —Unverified | 0 |
| Semantic segmentation of forest stands using deep learning | Apr 3, 2025 | Deep LearningFinancial Analysis | —Unverified | 0 |
| Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery | Apr 3, 2025 | Field Boundary DelineationInstance Segmentation | CodeCode Available | 2 |
| SelfMedHPM: Self Pre-training With Hard Patches Mining Masked Autoencoders For Medical Image Segmentation | Apr 3, 2025 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation | Apr 3, 2025 | Image SegmentationKnowledge Distillation | —Unverified | 0 |