| A Unified Framework for 3D Scene Understanding | Jul 3, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 2 |
| HiDiff: Hybrid Diffusion Framework for Medical Image Segmentation | Jul 3, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Context-Aware Video Instance Segmentation | Jul 3, 2024 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather | Jul 2, 2024 | Data AugmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts | Jul 2, 2024 | Few-Shot Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process | Jun 26, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval | Jun 26, 2024 | Action LocalizationMoment Retrieval | CodeCode Available | 2 |
| Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation? | Jun 24, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point Cloud | Jun 24, 2024 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 2 |
| SelfReg-UNet: Self-Regularized UNet for Medical Image Segmentation | Jun 21, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation | Jun 17, 2024 | DecoderSegmentation | CodeCode Available | 2 |
| Scaling Efficient Masked Image Modeling on Large Remote Sensing Dataset | Jun 17, 2024 | Aerial Scene ClassificationDiversity | CodeCode Available | 2 |
| Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset | Jun 10, 2024 | Instance SegmentationSalient Object Detection | CodeCode Available | 2 |
| Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language | Jun 9, 2024 | Contrastive LearningCross-Modal Retrieval | CodeCode Available | 2 |
| Medical Vision Generalist: Unifying Medical Imaging Tasks in Context | Jun 8, 2024 | Conditional Image GenerationDenoising | CodeCode Available | 2 |
| DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation | Jun 6, 2024 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Parameter-Inverted Image Pyramid Networks | Jun 6, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation | Jun 5, 2024 | Image SegmentationKolmogorov-Arnold Networks | CodeCode Available | 2 |
| DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut | Jun 5, 2024 | Image SegmentationSegmentation | CodeCode Available | 2 |
| Generative Active Learning for Long-tailed Instance Segmentation | Jun 4, 2024 | Active LearningInstance Segmentation | CodeCode Available | 2 |
| GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer | Jun 3, 2024 | 3D Object DetectionImage-to-Image Translation | CodeCode Available | 2 |
| Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation | Jun 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| TotalVibeSegmentator: Full Body MRI Segmentation for the NAKO and UK Biobank | May 31, 2024 | EpidemiologyHoldout Set | CodeCode Available | 2 |
| Open-Set Domain Adaptation for Semantic Segmentation | May 30, 2024 | Domain AdaptationSemantic Segmentation | CodeCode Available | 2 |
| Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation | May 28, 2024 | Instance SegmentationObject Proposal Generation | CodeCode Available | 2 |