| Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis | Dec 4, 2024 | Image GenerationImage Segmentation | —Unverified | 0 |
| Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution | Dec 4, 2024 | Image RestorationImage Super-Resolution | —Unverified | 0 |
| FLAIR: VLM with Fine-grained Language-informed Image Representations | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy | Dec 4, 2024 | AnatomyBenchmarking | —Unverified | 0 |
| Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance | Dec 3, 2024 | 3DGS3D Reconstruction | —Unverified | 0 |
| Topology-Preserving Image Segmentation with Spatial-Aware Persistent Feature Matching | Dec 3, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| Multi-scale and Multi-path Cascaded Convolutional Network for Semantic Segmentation of Colorectal Polyps | Dec 3, 2024 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Active Negative Loss: A Robust Framework for Learning with Noisy Labels | Dec 3, 2024 | Image SegmentationLearning with noisy labels | CodeCode Available | 1 |
| SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection | Dec 3, 2024 | GPUImage Segmentation | CodeCode Available | 0 |
| Vision Transformers for Weakly-Supervised Microorganism Enumeration | Dec 3, 2024 | Density EstimationInstance Segmentation | CodeCode Available | 0 |
| Low-Contrast-Enhanced Contrastive Learning for Semi-Supervised Endoscopic Image Segmentation | Dec 3, 2024 | Contrastive LearningImage Segmentation | CodeCode Available | 0 |
| AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation | Dec 3, 2024 | Domain AdaptationSegmentation | —Unverified | 0 |
| U-Net in Medical Image Segmentation: A Review of Its Applications Across Modalities | Dec 3, 2024 | AnatomyComputed Tomography (CT) | —Unverified | 0 |
| INSIGHT: Explainable Weakly-Supervised Medical Image Analysis | Dec 2, 2024 | Inductive BiasMedical Image Analysis | —Unverified | 0 |
| Epipolar Attention Field Transformers for Bird's Eye View Semantic Segmentation | Dec 2, 2024 | Bird's-Eye View Semantic SegmentationSemantic Segmentation | —Unverified | 0 |
| Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior | Dec 2, 2024 | Face RecognitionImage Compression | —Unverified | 0 |
| 3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting | Dec 2, 2024 | 3D scene EditingImage to 3D | —Unverified | 0 |
| Holistic Understanding of 3D Scenes as Universal Scene Description | Dec 2, 2024 | Instance SegmentationMixed Reality | —Unverified | 0 |
| Multi-Granularity Video Object Segmentation | Dec 2, 2024 | ObjectSegmentation | CodeCode Available | 1 |
| Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers | Dec 2, 2024 | Semantic Segmentation | —Unverified | 0 |
| Referring Video Object Segmentation via Language-aligned Track Selection | Dec 2, 2024 | ObjectObject Tracking | CodeCode Available | 1 |
| COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training | Dec 2, 2024 | Self-Supervised LearningSemantic Segmentation | CodeCode Available | 1 |
| A2VIS: Amodal-Aware Approach to Video Instance Segmentation | Dec 2, 2024 | Instance SegmentationMultiple Object Tracking | —Unverified | 0 |
| TSUBF-Net: Trans-Spatial UNet-like Network with Bi-direction Fusion for Segmentation of Adenoid Hypertrophy in CT | Dec 1, 2024 | Computed Tomography (CT)Image Segmentation | —Unverified | 0 |
| Token Cropr: Faster ViTs for Quite a Few Tasks | Dec 1, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation | Dec 1, 2024 | Data AugmentationImage Segmentation | —Unverified | 0 |
| DPE-Net: Dual-Parallel Encoder Based Network for Semantic Segmentation of Polyps | Dec 1, 2024 | DiversityImage Segmentation | —Unverified | 0 |
| SyncVIS: Synchronized Video Instance Segmentation | Dec 1, 2024 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification | Dec 1, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| SPF-Net: Solar panel fault detection using U-Net based deep learning image classification | Dec 1, 2024 | Deep LearningFault Detection | —Unverified | 0 |
| TAROT: Targeted Data Selection via Optimal Transport | Nov 30, 2024 | motion predictionSemantic Segmentation | CodeCode Available | 1 |
| LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation | Nov 30, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention | Nov 29, 2024 | Feature UpsamplingInstance Segmentation | CodeCode Available | 0 |
| Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding | Nov 29, 2024 | 3D geometry3DGS | CodeCode Available | 1 |
| Retrieval-guided Cross-view Image Synthesis | Nov 29, 2024 | Contrastive LearningDiversity | —Unverified | 0 |
| Beyond Logit Lens: Contextual Embeddings for Robust Hallucination Detection & Grounding in VLMs | Nov 28, 2024 | AttributeHallucination | —Unverified | 0 |
| Efficient Track Anything | Nov 28, 2024 | ObjectSegmentation | CodeCode Available | 7 |
| Textured As-Is BIM via GIS-informed Point Cloud Segmentation | Nov 28, 2024 | Object RecognitionPoint Cloud Segmentation | —Unverified | 0 |
| InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception | Nov 28, 2024 | 3DGSAutonomous Driving | —Unverified | 0 |
| Track Anything Behind Everything: Zero-Shot Amodal Video Object Segmentation | Nov 28, 2024 | 3D ReconstructionSegmentation | —Unverified | 0 |
| FAN-Unet: Enhancing Unet with vision Fourier Analysis Block for Biomedical Image Segmentation | Nov 28, 2024 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers | Nov 28, 2024 | image-classificationImage Classification | —Unverified | 0 |
| GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model | Nov 28, 2024 | Autonomous VehiclesPose Estimation | —Unverified | 0 |
| On-chip Hyperspectral Image Segmentation with Fully Convolutional Networks for Scene Understanding in Autonomous Driving | Nov 28, 2024 | Autonomous DrivingHyperspectral Image Segmentation | —Unverified | 0 |
| On Moving Object Segmentation from Monocular Video with Transformers | Nov 28, 2024 | 3D geometryMotion Segmentation | —Unverified | 0 |
| The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation | Nov 27, 2024 | Contrastive LearningDomain Adaptation | —Unverified | 0 |
| HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior | Nov 27, 2024 | Image Super-ResolutionSegmentation | CodeCode Available | 0 |
| Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation | Nov 26, 2024 | AllAutonomous Navigation | —Unverified | 0 |
| SAM-MPA: Applying SAM to Few-shot Medical Image Segmentation using Mask Propagation and Auto-prompting | Nov 26, 2024 | Few-Shot LearningImage Segmentation | —Unverified | 0 |
| Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation | Nov 26, 2024 | Autonomous DrivingContinual Learning | —Unverified | 0 |