| A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation | Dec 1, 2024 | Data AugmentationImage Segmentation | —Unverified | 0 |
| DPE-Net: Dual-Parallel Encoder Based Network for Semantic Segmentation of Polyps | Dec 1, 2024 | DiversityImage Segmentation | —Unverified | 0 |
| SyncVIS: Synchronized Video Instance Segmentation | Dec 1, 2024 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification | Dec 1, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| SPF-Net: Solar panel fault detection using U-Net based deep learning image classification | Dec 1, 2024 | Deep LearningFault Detection | —Unverified | 0 |
| TAROT: Targeted Data Selection via Optimal Transport | Nov 30, 2024 | motion predictionSemantic Segmentation | CodeCode Available | 1 |
| LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation | Nov 30, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention | Nov 29, 2024 | Feature UpsamplingInstance Segmentation | CodeCode Available | 0 |
| Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding | Nov 29, 2024 | 3D geometry3DGS | CodeCode Available | 1 |
| Retrieval-guided Cross-view Image Synthesis | Nov 29, 2024 | Contrastive LearningDiversity | —Unverified | 0 |
| Beyond Logit Lens: Contextual Embeddings for Robust Hallucination Detection & Grounding in VLMs | Nov 28, 2024 | AttributeHallucination | —Unverified | 0 |
| Efficient Track Anything | Nov 28, 2024 | ObjectSegmentation | CodeCode Available | 7 |
| Textured As-Is BIM via GIS-informed Point Cloud Segmentation | Nov 28, 2024 | Object RecognitionPoint Cloud Segmentation | —Unverified | 0 |
| InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception | Nov 28, 2024 | 3DGSAutonomous Driving | —Unverified | 0 |
| Track Anything Behind Everything: Zero-Shot Amodal Video Object Segmentation | Nov 28, 2024 | 3D ReconstructionSegmentation | —Unverified | 0 |
| FAN-Unet: Enhancing Unet with vision Fourier Analysis Block for Biomedical Image Segmentation | Nov 28, 2024 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers | Nov 28, 2024 | image-classificationImage Classification | —Unverified | 0 |
| GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model | Nov 28, 2024 | Autonomous VehiclesPose Estimation | —Unverified | 0 |
| On-chip Hyperspectral Image Segmentation with Fully Convolutional Networks for Scene Understanding in Autonomous Driving | Nov 28, 2024 | Autonomous DrivingHyperspectral Image Segmentation | —Unverified | 0 |
| On Moving Object Segmentation from Monocular Video with Transformers | Nov 28, 2024 | 3D geometryMotion Segmentation | —Unverified | 0 |
| The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation | Nov 27, 2024 | Contrastive LearningDomain Adaptation | —Unverified | 0 |
| HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior | Nov 27, 2024 | Image Super-ResolutionSegmentation | CodeCode Available | 0 |
| Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation | Nov 26, 2024 | AllAutonomous Navigation | —Unverified | 0 |
| SAM-MPA: Applying SAM to Few-shot Medical Image Segmentation using Mask Propagation and Auto-prompting | Nov 26, 2024 | Few-Shot LearningImage Segmentation | —Unverified | 0 |
| Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation | Nov 26, 2024 | Autonomous DrivingContinual Learning | —Unverified | 0 |