| Feature-Based Lie Group Transformer for Real-World Applications | Jun 5, 2025 | ObjectObject Recognition | —Unverified | 0 |
| OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model | Jun 5, 2025 | Instance SegmentationLanguage Modeling | CodeCode Available | 1 |
| Gen-n-Val: Agentic Image Data Generation and Validation | Jun 5, 2025 | Image HarmonizationInstance Segmentation | —Unverified | 0 |
| A Comprehensive Study on Medical Image Segmentation using Deep Neural Networks | Jun 4, 2025 | EthicsExplainable artificial intelligence | —Unverified | 0 |
| You Only Train Once | Jun 4, 2025 | Semantic Segmentation | —Unverified | 0 |
| Sounding that Object: Interactive Object-Aware Image to Audio Generation | Jun 4, 2025 | Audio GenerationImage Segmentation | —Unverified | 0 |
| Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding | Jun 3, 2025 | 3D Object DetectionAttribute | CodeCode Available | 2 |
| Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation | Jun 3, 2025 | SegmentationSemantic Segmentation | CodeCode Available | 0 |
| Hierarchical Self-Prompting SAM: A Prompt-Free Medical Image Segmentation Framework | Jun 3, 2025 | Image SegmentationLesion Segmentation | —Unverified | 0 |
| Zero-Shot Tree Detection and Segmentation from Aerial Forest Imagery | Jun 3, 2025 | Image SegmentationSegmentation | CodeCode Available | 1 |
| Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather | Jun 3, 2025 | LIDAR Semantic SegmentationSemantic Segmentation | —Unverified | 0 |
| GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal | Jun 3, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| InterRVOS: Interaction-aware Referring Video Object Segmentation | Jun 3, 2025 | 8kObject | —Unverified | 0 |
| Beyond Pixel Agreement: Large Language Models as Clinical Guardrails for Reliable Medical Image Segmentation | Jun 2, 2025 | DiagnosticImage Segmentation | —Unverified | 0 |
| SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost | Jun 2, 2025 | Image SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Overcoming Data Scarcity in Scanning Tunnelling Microscopy Image Segmentation | Jun 2, 2025 | Few-Shot LearningImage Segmentation | —Unverified | 0 |
| SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation | Jun 2, 2025 | Domain AdaptationNavigate | CodeCode Available | 1 |
| unMORE: Unsupervised Multi-Object Segmentation via Center-Boundary Reasoning | Jun 2, 2025 | Image ReconstructionObject | CodeCode Available | 0 |
| SoundSculpt: Direction and Semantics Driven Ambisonic Target Sound Extraction | May 30, 2025 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors | May 30, 2025 | Human-Object Interaction DetectionSemantic Segmentation | CodeCode Available | 1 |
| Unleashing the Power of Intermediate Domains for Mixed Domain Semi-Supervised Medical Image Segmentation | May 30, 2025 | Domain AdaptationImage Segmentation | CodeCode Available | 0 |
| NUC-Net: Non-uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation | May 30, 2025 | Autonomous DrivingGPU | CodeCode Available | 0 |
| Decoupled Competitive Framework for Semi-supervised Medical Image Segmentation | May 30, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation | May 30, 2025 | DecoderImage Segmentation | CodeCode Available | 0 |
| SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds | May 30, 2025 | Data AugmentationInstance Segmentation | —Unverified | 0 |
| Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation | May 30, 2025 | Autonomous DrivingContrastive Learning | CodeCode Available | 0 |
| Bi-Manual Joint Camera Calibration and Scene Representation | May 30, 2025 | Camera CalibrationRobot Manipulation | —Unverified | 0 |
| Semantics-Guided Generative Image Compression | May 29, 2025 | DecoderImage Compression | CodeCode Available | 0 |
| Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts | May 29, 2025 | 3D Semantic SegmentationDomain Generalization | —Unverified | 0 |
| MaskAdapt: Unsupervised Geometry-Aware Domain Adaptation Using Multimodal Contextual Learning and RGB-Depth Masking | May 29, 2025 | Domain AdaptationSemantic Segmentation | —Unverified | 0 |
| Adaptive Spatial Augmentation for Semi-supervised Semantic Segmentation | May 29, 2025 | Data AugmentationDiversity | —Unverified | 0 |
| VITON-DRR: Details Retention Virtual Try-on via Non-rigid Registration | May 29, 2025 | Image GenerationSemantic Segmentation | CodeCode Available | 0 |
| PCA for Enhanced Cross-Dataset Generalizability in Breast Ultrasound Tumor Segmentation | May 29, 2025 | Domain AdaptationImage Segmentation | —Unverified | 0 |
| Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation | May 29, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| LeMoRe: Learn More Details for Lightweight Semantic Segmentation | May 29, 2025 | Computational EfficiencyRepresentation Learning | CodeCode Available | 0 |
| Federated Unsupervised Semantic Segmentation | May 29, 2025 | Federated LearningImage Segmentation | —Unverified | 0 |
| TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models | May 29, 2025 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 2 |
| SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning | May 28, 2025 | Image SegmentationMultimodal Reasoning | —Unverified | 0 |
| Adapting Segment Anything Model for Power Transmission Corridor Hazard Segmentation | May 28, 2025 | Semantic Segmentation | CodeCode Available | 0 |
| PathFL: Multi-Alignment Federated Learning for Pathology Image Segmentation | May 28, 2025 | Federated LearningImage Segmentation | CodeCode Available | 0 |
| Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation | May 28, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation | May 28, 2025 | Domain AdaptationInstance Segmentation | —Unverified | 0 |
| A Survey on Training-free Open-Vocabulary Semantic Segmentation | May 28, 2025 | Multi-modal ClassificationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction | May 28, 2025 | object-detectionObject Detection | —Unverified | 0 |
| MAMBO-NET: Multi-Causal Aware Modeling Backdoor-Intervention Optimization for Medical Image Segmentation Network | May 28, 2025 | Causal InferenceImage Segmentation | —Unverified | 0 |
| ConfLUNet: Multiple sclerosis lesion instance segmentation in presence of confluent lesions | May 28, 2025 | Instance SegmentationLesion Detection | —Unverified | 0 |
| LiDAR Based Semantic Perception for Forklifts in Outdoor Environments | May 28, 2025 | Scene UnderstandingSegmentation | —Unverified | 0 |
| Geometric Feature Prompting of Image Segmentation Models | May 27, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning | May 27, 2025 | Imitation LearningSemantic Segmentation | —Unverified | 0 |
| Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments | May 26, 2025 | Semantic Segmentation | —Unverified | 0 |