| OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding | Apr 15, 2025 | Semantic SegmentationVideo Generation | —Unverified | 0 |
| CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image | Apr 15, 2025 | Instance SegmentationPose Estimation | —Unverified | 0 |
| Efficient and Robust Remote Sensing Image Denoising Using Randomized Approximation of Geodesics' Gramian on the Manifold Underlying the Patch Space | Apr 15, 2025 | DenoisingImage Denoising | —Unverified | 0 |
| Efficient Medical Image Restoration via Reliability Guided Learning in Frequency Domain | Apr 15, 2025 | DenoisingImage Denoising | —Unverified | 0 |
| IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme | Apr 14, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| HDC: Hierarchical Distillation for Multi-level Noisy Consistency in Semi-Supervised Fetal Ultrasound Segmentation | Apr 14, 2025 | AnatomyImage Segmentation | —Unverified | 0 |
| M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR Data | Apr 14, 2025 | Autonomous VehiclesSemantic Segmentation | CodeCode Available | 0 |
| The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer | Apr 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation | Apr 14, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| MASSeg : 2nd Technical Report for 4th PVUW MOSE Track | Apr 14, 2025 | Data AugmentationObject | CodeCode Available | 0 |
| Real-time Seafloor Segmentation and Mapping | Apr 14, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| Advancing RFI-Detection in Radio Astronomy with Liquid State Machines | Apr 14, 2025 | AstronomySemantic Segmentation | —Unverified | 0 |
| Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials | Apr 14, 2025 | Image SegmentationPrompt Engineering | —Unverified | 0 |
| FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution | Apr 13, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Mixture-of-Shape-Experts (MoSE): End-to-End Shape Dictionary Framework to Prompt SAM for Generalizable Medical Segmentation | Apr 13, 2025 | Dictionary LearningDomain Generalization | —Unverified | 0 |
| AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images | Apr 12, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| PathSeqSAM: Sequential Modeling for Pathology Image Segmentation with SAM2 | Apr 12, 2025 | Domain AdaptationImage Segmentation | CodeCode Available | 0 |
| Multi-Modal Brain Tumor Segmentation via 3D Multi-Scale Self-attention and Cross-attention | Apr 12, 2025 | Brain Tumor SegmentationDecoder | —Unverified | 0 |
| A Unified Loss for Handling Inter-Class and Intra-Class Imbalance in Medical Image Segmentation | Apr 11, 2025 | Image SegmentationMedical Image Analysis | CodeCode Available | 0 |
| Do Segmentation Models Understand Vascular Structure? A Blob-Based XAI Framework | Apr 11, 2025 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization | Apr 11, 2025 | DenoisingObject | —Unverified | 0 |
| DSM: Building A Diverse Semantic Map for 3D Visual Grounding | Apr 11, 2025 | 3D visual groundingScene Understanding | —Unverified | 0 |
| Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing | Apr 11, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Multi-person Physics-based Pose Estimation for Combat Sports | Apr 11, 2025 | 3D Human Pose Estimation3D Multi-Person Pose Estimation | —Unverified | 0 |
| Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation | Apr 11, 2025 | Depth EstimationInstance Segmentation | CodeCode Available | 0 |