| Identity documents recognition and detection using semantic segmentation with convolutional neural network | Mar 3, 2025 | Object RecognitionSemantic Segmentation | —Unverified | 0 |
| Hyperspectral image segmentation with a machine learning model trained using quantum annealer | Mar 3, 2025 | Hyperspectral Image SegmentationImage Segmentation | —Unverified | 0 |
| UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface | Mar 3, 2025 | Instance SegmentationReasoning Segmentation | CodeCode Available | 3 |
| AirRoom: Objects Matter in Room Reidentification | Mar 3, 2025 | ObjectSemantic Segmentation | —Unverified | 0 |
| OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging | Mar 3, 2025 | 3D Instance SegmentationInstance Segmentation | —Unverified | 0 |
| SparseMamba-PCL: Scribble-Supervised Medical Image Segmentation via SAM-Guided Progressive Collaborative Learning | Mar 3, 2025 | DecoderImage Segmentation | CodeCode Available | 0 |
| Training-Free Dataset Pruning for Instance Segmentation | Mar 2, 2025 | Instance SegmentationSegmentation | CodeCode Available | 0 |
| IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis | Mar 2, 2025 | Image SegmentationImage-text matching | CodeCode Available | 1 |
| Unifying Light Field Perception with Field of Parallax | Mar 2, 2025 | Multi-Task Learningobject-detection | CodeCode Available | 0 |
| HiMo: High-Speed Objects Motion Compensation in Point Clouds | Mar 2, 2025 | Autonomous VehiclesMotion Compensation | —Unverified | 0 |
| Conformal Lyapunov Optimization: Optimal Resource Allocation under Deterministic Reliability Constraints | Mar 1, 2025 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| Detection of Customer Interested Garments in Surveillance Video using Computer Vision | Mar 1, 2025 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| Explainable LiDAR 3D Point Cloud Segmentation and Clustering for Detecting Airplane-Generated Wind Turbulence | Mar 1, 2025 | ClusteringDecision Making | —Unverified | 0 |
| Ranking pre-trained segmentation models for zero-shot transferability | Mar 1, 2025 | Instance SegmentationModel Selection | —Unverified | 0 |
| SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models | Feb 28, 2025 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Autoregressive Medical Image Segmentation via Next-Scale Mask Prediction | Feb 28, 2025 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| Style Content Decomposition-based Data Augmentation for Domain Generalizable Medical Image Segmentation | Feb 28, 2025 | Data AugmentationImage Segmentation | CodeCode Available | 0 |
| OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels | Feb 27, 2025 | Image ClassificationInstance Segmentation | CodeCode Available | 4 |
| Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds | Feb 27, 2025 | 3D Object DetectionDecoder | —Unverified | 0 |
| Weakly Supervised Segmentation Framework for Thyroid Nodule Based on High-confidence Labels and High-rationality Losses | Feb 27, 2025 | Image SegmentationSegmentation | CodeCode Available | 0 |
| Test-Time Modality Generalization for Medical Image Segmentation | Feb 27, 2025 | Domain GeneralizationImage Segmentation | —Unverified | 0 |
| You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving | Feb 27, 2025 | 3D Instance SegmentationAutonomous Driving | —Unverified | 0 |
| Learning Mask Invariant Mutual Information for Masked Image Modeling | Feb 27, 2025 | Contrastive Learningimage-classification | —Unverified | 0 |
| 3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds | Feb 27, 2025 | Affordance DetectionHuman-Object Interaction Detection | —Unverified | 0 |
| SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird's-Eye-View Segmentation | Feb 27, 2025 | Autonomous DrivingBEV Segmentation | CodeCode Available | 1 |
| Enhanced Neuromorphic Semantic Segmentation Latency through Stream Event | Feb 26, 2025 | SegmentationSelf-Driving Cars | —Unverified | 0 |
| Dictionary-based Framework for Interpretable and Consistent Object Parsing | Feb 26, 2025 | Contrastive LearningObject | —Unverified | 0 |
| An Analysis of Data Transformation Effects on Segment Anything 2 | Feb 25, 2025 | Semantic SegmentationVideo Object Segmentation | —Unverified | 0 |
| NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts | Feb 25, 2025 | Image SegmentationLanguage Identification | —Unverified | 0 |
| CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems | Feb 25, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| OpenFly: A Comprehensive Platform for Aerial Vision-Language Navigation | Feb 25, 2025 | BenchmarkingSemantic Segmentation | —Unverified | 0 |
| VesselSAM: Leveraging SAM for Aortic Vessel Segmentation with AtrousLoRA | Feb 25, 2025 | Computational EfficiencyImage Segmentation | CodeCode Available | 1 |
| CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation | Feb 24, 2025 | 3D Instance SegmentationContinual Learning | CodeCode Available | 0 |
| M3DA: Benchmark for Unsupervised Domain Adaptation in 3D Medical Image Segmentation | Feb 24, 2025 | Computed Tomography (CT)Domain Adaptation | CodeCode Available | 0 |
| An Enhanced Large Language Model For Cross Modal Query Understanding System Using DL-KeyBERT Based CAZSSCL-MPGPT | Feb 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Priori Generalizability Estimate for a CNN | Feb 24, 2025 | Diagnosticimage-classification | —Unverified | 0 |
| MDN: Mamba-Driven Dualstream Network For Medical Hyperspectral Image Segmentation | Feb 24, 2025 | Hyperspectral Image SegmentationImage Segmentation | CodeCode Available | 0 |
| SpecDM: Hyperspectral Dataset Synthesis with Pixel-level Semantic Annotations | Feb 24, 2025 | Change DetectionDataset Generation | —Unverified | 0 |
| DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks | Feb 24, 2025 | Conditional Image GenerationImage Generation | CodeCode Available | 3 |
| A Comparative Tutorial of the Histogram-based Image Segmentation Methods | Feb 23, 2025 | Image SegmentationSegmentation | —Unverified | 0 |
| Rewards-based image analysis in microscopy | Feb 23, 2025 | Decision MakingDenoising | —Unverified | 0 |
| Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration | Feb 23, 2025 | 3DGS3D Semantic Segmentation | —Unverified | 0 |
| VPNeXt -- Rethinking Dense Decoding for Plain Vision Transformer | Feb 23, 2025 | DecoderSemantic Segmentation | —Unverified | 0 |
| Deep learning approaches to surgical video segmentation and object detection: A Scoping Review | Feb 23, 2025 | object-detectionObject Detection | —Unverified | 0 |
| AeroReformer: Aerial Referring Transformer for UAV-based Referring Image Segmentation | Feb 23, 2025 | Image SegmentationSegmentation | CodeCode Available | 1 |
| Cross-Model Transferability of Adversarial Patches in Real-time Segmentation for Autonomous Driving | Feb 22, 2025 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 0 |
| FeatSharp: Your Vision Model Features, Sharper | Feb 22, 2025 | modelobject-detection | —Unverified | 0 |
| UrbanSAM: Learning Invariance-Inspired Adapters for Segment Anything Models in Urban Construction | Feb 21, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Image Translation-Based Unsupervised Cross-Modality Domain Adaptation for Medical Image Segmentation | Feb 21, 2025 | Deep LearningDomain Adaptation | —Unverified | 0 |
| Confidence-Weighted Boundary-Aware Learning for Semi-Supervised Semantic Segmentation | Feb 21, 2025 | Pseudo Label FilteringSegmentation | CodeCode Available | 0 |