| LaSagnA: Language-based Segmentation Assistant for Complex Queries | Apr 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| ViM-UNet: Vision Mamba for Biomedical Segmentation | Apr 11, 2024 | Instance SegmentationMamba | CodeCode Available | 2 |
| Multi-view Aggregation Network for Dichotomous Image Segmentation | Apr 11, 2024 | DecoderDichotomous Image Segmentation | CodeCode Available | 2 |
| Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation | Apr 9, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping | Apr 9, 2024 | Image RetrievalObject | CodeCode Available | 2 |
| LHU-Net: A Light Hybrid U-Net for Cost-Efficient, High-Performance Volumetric Medical Image Segmentation | Apr 7, 2024 | Computational EfficiencyImage Segmentation | CodeCode Available | 2 |
| HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Apr 3, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Apr 2, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model | Apr 2, 2024 | DecoderMamba | CodeCode Available | 2 |
| T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation | Apr 1, 2024 | Image SegmentationMamba | CodeCode Available | 2 |
| Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts | Mar 31, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation | Mar 29, 2024 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 |
| ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning | Mar 29, 2024 | Continual LearningContinual Panoptic Segmentation | CodeCode Available | 2 |
| AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation | Mar 29, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | Mar 28, 2024 | Fine-Grained Image ClassificationImage Classification | CodeCode Available | 2 |
| Generative Medical Segmentation | Mar 27, 2024 | DecoderDomain Generalization | CodeCode Available | 2 |
| Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding | Mar 27, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Efficient Video Object Segmentation via Modulated Cross-Attention Memory | Mar 26, 2024 | GPUObject | CodeCode Available | 2 |
| DreamLIP: Language-Image Pre-training with Long Captions | Mar 25, 2024 | Contrastive LearningImage-text Retrieval | CodeCode Available | 2 |
| TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane Segmentation | Mar 25, 2024 | Autonomous DrivingDrivable Area Detection | CodeCode Available | 2 |
| Is Your LiDAR Placement Optimized for 3D Scene Understanding? | Mar 25, 2024 | 3D Object DetectionLIDAR Semantic Segmentation | CodeCode Available | 2 |
| LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels | Mar 22, 2024 | 3D Semantic SegmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models | Mar 21, 2024 | Image GenerationSemantic Segmentation | CodeCode Available | 2 |
| H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation | Mar 20, 2024 | Image SegmentationLesion Segmentation | CodeCode Available | 2 |
| Diversified and Personalized Multi-rater Medical Image Segmentation | Mar 20, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation | Mar 20, 2024 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 2 |
| Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Mar 18, 2024 | Instance SegmentationNeRF | CodeCode Available | 2 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 |
| Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation | Mar 18, 2024 | Mixture-of-Expertsparameter-efficient fine-tuning | CodeCode Available | 2 |
| DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation | Mar 17, 2024 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 2 |
| VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation | Mar 14, 2024 | Image SegmentationMamba | CodeCode Available | 2 |
| Caltech Aerial RGB-Thermal Dataset in the Wild | Mar 13, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| LKM-UNet: Large Kernel Vision Mamba UNet for Medical Image Segmentation | Mar 12, 2024 | Image SegmentationLong-range modeling | CodeCode Available | 2 |
| Open-World Semantic Segmentation Including Class Similarity | Mar 12, 2024 | Anomaly SegmentationAutonomous Vehicles | CodeCode Available | 2 |
| SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAM | Mar 12, 2024 | Semantic SegmentationSemantic SLAM | CodeCode Available | 2 |
| Frequency-Adaptive Dilated Convolution for Semantic Segmentation | Mar 8, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation | Mar 8, 2024 | Federated LearningImage Segmentation | CodeCode Available | 2 |
| Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels | Mar 5, 2024 | Pseudo LabelSemantic Segmentation | CodeCode Available | 2 |
| AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation | Mar 4, 2024 | Semantic SegmentationSemi-Supervised Semantic Segmentation | CodeCode Available | 2 |
| EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation | Mar 3, 2024 | ObjectRepresentation Learning | CodeCode Available | 2 |
| Rethinking Few-shot 3D Point Cloud Semantic Segmentation | Mar 1, 2024 | Few-shot 3D Point Cloud Semantic SegmentationSegmentation | CodeCode Available | 2 |
| PEM: Prototype-based Efficient MaskFormer for Image Segmentation | Feb 29, 2024 | Image SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything | Feb 29, 2024 | 3D Object ReconstructionInstance Segmentation | CodeCode Available | 2 |
| Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation | Feb 28, 2024 | Semantic SegmentationTAG | CodeCode Available | 2 |
| UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei Images | Feb 26, 2024 | DecoderSegmentation | CodeCode Available | 2 |
| SPINEPS -- Automatic Whole Spine Segmentation of T2-weighted MR images using a Two-Phase Approach to Multi-class Semantic and Instance Segmentation | Feb 26, 2024 | Instance SegmentationSegmentation | CodeCode Available | 2 |
| BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation | Feb 13, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models | Feb 7, 2024 | Instance SegmentationObject | CodeCode Available | 2 |
| CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model | Feb 6, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model | Feb 5, 2024 | 3D Medical Imaging SegmentationImage Segmentation | CodeCode Available | 2 |