| Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration | Jun 12, 2025 | cross-modal alignmentImage to text | —Unverified | 0 |
| MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models | Jun 12, 2025 | Image SegmentationMedical Diagnosis | —Unverified | 0 |
| ALBERT: Advanced Localization and Bidirectional Encoder Representations from Transformers for Automotive Damage Evaluation | Jun 12, 2025 | Instance SegmentationSegmentation | —Unverified | 0 |
| Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models | Jun 12, 2025 | Image GenerationSegmentation | —Unverified | 0 |
| Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation | Jun 12, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| FARCLUSS: Fuzzy Adaptive Rebalancing and Contrastive Uncertainty Learning for Semi-Supervised Semantic Segmentation | Jun 11, 2025 | Semantic SegmentationSemi-Supervised Semantic Segmentation | CodeCode Available | 0 |
| Deep Semantic Segmentation for Multi-Source Localization Using Angle of Arrival Measurements | Jun 11, 2025 | Semantic Segmentation | —Unverified | 0 |
| The Four Color Theorem for Cell Instance Segmentation | Jun 11, 2025 | Computational EfficiencyInstance Segmentation | CodeCode Available | 1 |
| SRPL-SFDA: SAM-Guided Reliable Pseudo-Labels for Source-Free Domain Adaptation in Medical Image Segmentation | Jun 11, 2025 | Domain AdaptationImage Segmentation | CodeCode Available | 0 |
| Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments | Jun 11, 2025 | Domain AdaptationPoint Cloud Segmentation | —Unverified | 0 |
| Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation | Jun 11, 2025 | Autonomous DrivingDomain Generalization | CodeCode Available | 1 |
| Accurate and efficient zero-shot 6D pose estimation with frozen foundation models | Jun 11, 2025 | 6D Pose EstimationInstance Segmentation | —Unverified | 0 |
| ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models | Jun 11, 2025 | Image GenerationImage Segmentation | —Unverified | 0 |
| Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20^th century Urban Landscapes with Satellite Imageries | Jun 11, 2025 | SegmentationSelf-Supervised Learning | CodeCode Available | 2 |
| ContextLoss: Context Information for Topology-Preserving Segmentation | Jun 10, 2025 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation | Jun 10, 2025 | FoveationImage Segmentation | CodeCode Available | 2 |
| ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network | Jun 10, 2025 | GPUMamba | —Unverified | 0 |
| DCD: A Semantic Segmentation Model for Fetal Ultrasound Four-Chamber View | Jun 10, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Segment Concealed Objects with Incomplete Supervision | Jun 10, 2025 | Pseudo LabelSegmentation | —Unverified | 0 |
| ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction | Jun 10, 2025 | object-detectionObject Detection | —Unverified | 0 |
| SSS: Semi-Supervised SAM-2 with Efficient Prompting for Medical Imaging Segmentation | Jun 10, 2025 | Data AugmentationImage Segmentation | CodeCode Available | 0 |
| RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation | Jun 10, 2025 | Semantic SegmentationSemi-Supervised Semantic Segmentation | CodeCode Available | 1 |
| Segment Any Architectural Facades (SAAF):An automatic segmentation model for building facades, walls and windows based on multimodal semantics guidance | Jun 9, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| C3S3: Complementary Competition and Contrastive Selection for Semi-Supervised Medical Image Segmentation | Jun 9, 2025 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| F2Net: A Frequency-Fused Network for Ultra-High Resolution Remote Sensing Segmentation | Jun 9, 2025 | Semantic Segmentation | —Unverified | 0 |
| SAM2Auto: Auto Annotation Using FLASH | Jun 9, 2025 | Instance SegmentationObject | —Unverified | 0 |
| Trend-Aware Fashion Recommendation with Visual Segmentation and Semantic Similarity | Jun 9, 2025 | Semantic SegmentationSemantic Similarity | CodeCode Available | 0 |
| LogoSP: Local-global Grouping of Superpoints for Unsupervised Semantic Segmentation of 3D Point Clouds | Jun 9, 2025 | 3D Semantic SegmentationSegmentation | CodeCode Available | 1 |
| Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation | Jun 9, 2025 | Cross-Domain Few-ShotFew-Shot Semantic Segmentation | —Unverified | 0 |
| Text-guided multi-stage cross-perception network for medical image segmentation | Jun 9, 2025 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting | Jun 9, 2025 | 3DGS3D Instance Segmentation | —Unverified | 0 |
| PIG: Physically-based Multi-Material Interaction with 3D Gaussians | Jun 9, 2025 | Scene GenerationSegmentation | —Unverified | 0 |
| IGraSS: Learning to Identify Infrastructure Networks from Satellite Imagery by Iterative Graph-constrained Semantic Segmentation | Jun 9, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| A System for Accurate Tracking and Video Recordings of Rodent Eye Movements using Convolutional Neural Networks for Biomedical Image Segmentation | Jun 9, 2025 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| Multiple Object Stitching for Unsupervised Representation Learning | Jun 9, 2025 | Contrastive LearningObject | CodeCode Available | 1 |
| Active Contour Models Driven by Hyperbolic Mean Curvature Flow for Image Segmentation | Jun 7, 2025 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| Stepwise Decomposition and Dual-stream Focus: A Novel Approach for Training-free Camouflaged Object Segmentation | Jun 7, 2025 | Camouflaged Object SegmentationFeature Correlation | CodeCode Available | 0 |
| THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation | Jun 7, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping | Jun 6, 2025 | 6D Pose EstimationInstance Segmentation | —Unverified | 0 |
| NeurNCD: Novel Class Discovery via Implicit Neural Representation | Jun 6, 2025 | NeRFNovel Class Discovery | —Unverified | 0 |
| GS4: Generalizable Sparse Splatting Semantic SLAM | Jun 6, 2025 | 3D Semantic SegmentationSemantic Segmentation | —Unverified | 0 |
| U-NetMN and SegNetMN: Modified U-Net and SegNet models for bimodal SAR image segmentation | Jun 5, 2025 | Body DetectionComputational Efficiency | —Unverified | 0 |
| DM-SegNet: Dual-Mamba Architecture for 3D Medical Image Segmentation with Global Context Modeling | Jun 5, 2025 | AnatomyBrain Tumor Segmentation | —Unverified | 0 |
| Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos | Jun 5, 2025 | GPUSemantic Segmentation | CodeCode Available | 2 |
| Refer to Anything with Vision-Language Prompts | Jun 5, 2025 | BenchmarkingGeneralized Referring Expression Segmentation | —Unverified | 0 |
| Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery | Jun 5, 2025 | Instance SegmentationSemantic Segmentation | —Unverified | 0 |
| SAM-aware Test-time Adaptation for Universal Medical Image Segmentation | Jun 5, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting | Jun 5, 2025 | 3DGSPoint Cloud Segmentation | —Unverified | 0 |
| VideoMolmo: Spatio-Temporal Grounding Meets Pointing | Jun 5, 2025 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 2 |
| OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model | Jun 5, 2025 | Instance SegmentationLanguage Modeling | CodeCode Available | 1 |