| Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation | Apr 12, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning | Apr 12, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 2 |
| ViM-UNet: Vision Mamba for Biomedical Segmentation | Apr 11, 2024 | Instance SegmentationMamba | CodeCode Available | 2 |
| Deep learning-driven pulmonary artery and vein segmentation reveals demography-associated vasculature anatomical differences | Apr 11, 2024 | AnatomySegmentation | CodeCode Available | 2 |
| Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation | Apr 9, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| LHU-Net: A Light Hybrid U-Net for Cost-Efficient, High-Performance Volumetric Medical Image Segmentation | Apr 7, 2024 | Computational EfficiencyImage Segmentation | CodeCode Available | 2 |
| Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures | Apr 3, 2024 | CPUGPU | CodeCode Available | 2 |
| Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model | Apr 2, 2024 | DecoderMamba | CodeCode Available | 2 |
| Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation | Apr 1, 2024 | Action SegmentationSegmentation | CodeCode Available | 2 |
| Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts | Mar 31, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation | Mar 29, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning | Mar 29, 2024 | Continual LearningContinual Panoptic Segmentation | CodeCode Available | 2 |
| MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation | Mar 29, 2024 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 |
| SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects | Mar 29, 2024 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |
| Generative Medical Segmentation | Mar 27, 2024 | DecoderDomain Generalization | CodeCode Available | 2 |
| Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding | Mar 27, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Efficient Video Object Segmentation via Modulated Cross-Attention Memory | Mar 26, 2024 | GPUObject | CodeCode Available | 2 |
| TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane Segmentation | Mar 25, 2024 | Autonomous DrivingDrivable Area Detection | CodeCode Available | 2 |
| Diversified and Personalized Multi-rater Medical Image Segmentation | Mar 20, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation | Mar 20, 2024 | Image SegmentationLesion Segmentation | CodeCode Available | 2 |
| Better Call SAL: Towards Learning to Segment Anything in Lidar | Mar 19, 2024 | Panoptic SegmentationSegmentation | CodeCode Available | 2 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 |
| Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Mar 18, 2024 | Instance SegmentationNeRF | CodeCode Available | 2 |
| PosSAM: Panoptic Open-vocabulary Segment Anything | Mar 14, 2024 | DecoderOpen Vocabulary Panoptic Segmentation | CodeCode Available | 2 |
| Caltech Aerial RGB-Thermal Dataset in the Wild | Mar 13, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Open-World Semantic Segmentation Including Class Similarity | Mar 12, 2024 | Anomaly SegmentationAutonomous Vehicles | CodeCode Available | 2 |
| FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation | Mar 8, 2024 | Federated LearningImage Segmentation | CodeCode Available | 2 |
| EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation | Mar 3, 2024 | ObjectRepresentation Learning | CodeCode Available | 2 |
| Rethinking Few-shot 3D Point Cloud Semantic Segmentation | Mar 1, 2024 | Few-shot 3D Point Cloud Semantic SegmentationSegmentation | CodeCode Available | 2 |
| FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything | Feb 29, 2024 | 3D Object ReconstructionInstance Segmentation | CodeCode Available | 2 |
| PEM: Prototype-based Efficient MaskFormer for Image Segmentation | Feb 29, 2024 | Image SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| VRP-SAM: SAM with Visual Reference Prompt | Feb 27, 2024 | Meta-LearningSegmentation | CodeCode Available | 2 |
| SPINEPS -- Automatic Whole Spine Segmentation of T2-weighted MR images using a Two-Phase Approach to Multi-class Semantic and Instance Segmentation | Feb 26, 2024 | Instance SegmentationSegmentation | CodeCode Available | 2 |
| UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei Images | Feb 26, 2024 | DecoderSegmentation | CodeCode Available | 2 |
| Subobject-level Image Tokenization | Feb 22, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition | Feb 22, 2024 | Image-level Supervised Instance Segmentationobject-detection | CodeCode Available | 2 |
| Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision | Feb 14, 2024 | Language ModellingSegmentation | CodeCode Available | 2 |
| BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation | Feb 13, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models | Feb 7, 2024 | Instance SegmentationObject | CodeCode Available | 2 |
| ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image Segmentation | Feb 3, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition | Jan 31, 2024 | Novel View SynthesisSegmentation | CodeCode Available | 2 |
| Vivim: a Video Vision Mamba for Medical Video Segmentation | Jan 25, 2024 | Lesion SegmentationMamba | CodeCode Available | 2 |
| Tyche: Stochastic In-Context Learning for Medical Image Segmentation | Jan 24, 2024 | Image SegmentationIn-Context Learning | CodeCode Available | 2 |
| SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI | Jan 23, 2024 | MRI segmentationSegmentation | CodeCode Available | 2 |
| PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation | Jan 23, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| ClipSAM: CLIP and SAM Collaboration for Zero-Shot Anomaly Segmentation | Jan 23, 2024 | Anomaly LocalizationAnomaly Segmentation | CodeCode Available | 2 |
| CloSe: A 3D Clothing Segmentation Dataset and Model | Jan 22, 2024 | Continual Learningmodel | CodeCode Available | 2 |
| Pixel-Wise Recognition for Holistic Surgical Scene Understanding | Jan 20, 2024 | Scene UnderstandingSegmentation | CodeCode Available | 2 |
| A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Jan 18, 2024 | Instance SegmentationInteractive Segmentation | CodeCode Available | 2 |
| OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box Prompts | Jan 16, 2024 | Amodal Instance SegmentationInstance Segmentation | CodeCode Available | 2 |