| A Unified Framework for 3D Scene Understanding | Jul 3, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 2 |
| HiDiff: Hybrid Diffusion Framework for Medical Image Segmentation | Jul 3, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Context-Aware Video Instance Segmentation | Jul 3, 2024 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather | Jul 2, 2024 | Data AugmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts | Jul 2, 2024 | Few-Shot Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval | Jun 26, 2024 | Action LocalizationMoment Retrieval | CodeCode Available | 2 |
| Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process | Jun 26, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point Cloud | Jun 24, 2024 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 2 |
| Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation? | Jun 24, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| SelfReg-UNet: Self-Regularized UNet for Medical Image Segmentation | Jun 21, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation | Jun 17, 2024 | DecoderSegmentation | CodeCode Available | 2 |
| Scaling Efficient Masked Image Modeling on Large Remote Sensing Dataset | Jun 17, 2024 | Aerial Scene ClassificationDiversity | CodeCode Available | 2 |
| Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset | Jun 10, 2024 | Instance SegmentationSalient Object Detection | CodeCode Available | 2 |
| Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language | Jun 9, 2024 | Contrastive LearningCross-Modal Retrieval | CodeCode Available | 2 |
| Medical Vision Generalist: Unifying Medical Imaging Tasks in Context | Jun 8, 2024 | Conditional Image GenerationDenoising | CodeCode Available | 2 |
| DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation | Jun 6, 2024 | Real-Time Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Parameter-Inverted Image Pyramid Networks | Jun 6, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut | Jun 5, 2024 | Image SegmentationSegmentation | CodeCode Available | 2 |
| U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation | Jun 5, 2024 | Image SegmentationKolmogorov-Arnold Networks | CodeCode Available | 2 |
| Generative Active Learning for Long-tailed Instance Segmentation | Jun 4, 2024 | Active LearningInstance Segmentation | CodeCode Available | 2 |
| GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer | Jun 3, 2024 | 3D Object DetectionImage-to-Image Translation | CodeCode Available | 2 |
| Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation | Jun 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| TotalVibeSegmentator: Full Body MRI Segmentation for the NAKO and UK Biobank | May 31, 2024 | EpidemiologyHoldout Set | CodeCode Available | 2 |
| Open-Set Domain Adaptation for Semantic Segmentation | May 30, 2024 | Domain AdaptationSemantic Segmentation | CodeCode Available | 2 |
| Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation | May 28, 2024 | Instance SegmentationObject Proposal Generation | CodeCode Available | 2 |
| Memorize What Matters: Emergent Scene Decomposition from Multitraverse | May 27, 2024 | 3D ReconstructionNeural Rendering | CodeCode Available | 2 |
| Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models | May 27, 2024 | SegmentationSemantic correspondence | CodeCode Available | 2 |
| Mamba-R: Vision Mamba ALSO Needs Registers | May 23, 2024 | MambaSemantic Segmentation | CodeCode Available | 2 |
| KPConvX: Modernizing Kernel Point Convolution with Kernel Attention | May 21, 2024 | 3D Point Cloud ClassificationSemantic Segmentation | CodeCode Available | 2 |
| DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data | May 16, 2024 | Data AugmentationDiversity | CodeCode Available | 2 |
| Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation | May 10, 2024 | Semantic Segmentation | CodeCode Available | 2 |
| Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale Attention | May 10, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs | May 10, 2024 | graph constructionimage-classification | CodeCode Available | 2 |
| OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies | May 8, 2024 | Domain AdaptationScene Understanding | CodeCode Available | 2 |
| PTQ4SAM: Post-Training Quantization for Segment Anything | May 6, 2024 | Instance Segmentationobject-detection | CodeCode Available | 2 |
| MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning | May 4, 2024 | Earth Observationimage-classification | CodeCode Available | 2 |
| ASAM: Boosting Segment Anything Model with Adversarial Tuning | May 1, 2024 | Image Segmentationmodel | CodeCode Available | 2 |
| Adaptive Bidirectional Displacement for Semi-Supervised Medical Image Segmentation | May 1, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation | Apr 30, 2024 | AttributeSemantic Segmentation | CodeCode Available | 2 |
| Multi-Scale Representations by Varying Window Attention for Semantic Segmentation | Apr 25, 2024 | DecoderSemantic Segmentation | CodeCode Available | 2 |
| A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation | Apr 25, 2024 | Autonomous DrivingEvolutionary Algorithms | CodeCode Available | 2 |
| Multimodal Information Interaction for Medical Image Segmentation | Apr 25, 2024 | Heart SegmentationImage Segmentation | CodeCode Available | 2 |
| Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation | Apr 21, 2024 | Semantic SegmentationVideo Object Segmentation | CodeCode Available | 2 |
| Augmented Object Intelligence with XR-Objects | Apr 20, 2024 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework | Apr 19, 2024 | Earth ObservationSegmentation | CodeCode Available | 2 |
| MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model | Apr 19, 2024 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training | Apr 18, 2024 | Contrastive LearningCPU | CodeCode Available | 2 |
| Constructing and Exploring Intermediate Domains in Mixed Domain Semi-supervised Medical Image Segmentation | Apr 13, 2024 | Domain AdaptationImage Segmentation | CodeCode Available | 2 |
| LaSagnA: Language-based Segmentation Assistant for Complex Queries | Apr 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning | Apr 12, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 2 |