| Image Segmentation in Foundation Model Era: A Survey | Aug 23, 2024 | Image SegmentationInstance Segmentation | CodeCode Available | 2 |
| HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image Segmentation | Aug 21, 2024 | Image SegmentationMamba | CodeCode Available | 2 |
| UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images | Aug 21, 2024 | MambaSegmentation | CodeCode Available | 2 |
| MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis | Aug 14, 2024 | Anomaly DetectionBoundary Detection | CodeCode Available | 2 |
| Robust Semi-supervised Multimodal Medical Image Segmentation via Cross Modality Collaboration | Aug 14, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation | Aug 13, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Aug 9, 2024 | Image to textObject | CodeCode Available | 2 |
| ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation | Aug 9, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Aug 7, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| DaCapo: a modular deep learning framework for scalable 3D image segmentation | Aug 5, 2024 | Image SegmentationManagement | CodeCode Available | 2 |
| StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation | Aug 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Aug 2, 2024 | cross-modal alignmentMultiple Object Tracking | CodeCode Available | 2 |
| Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Aug 1, 2024 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| MSA^2Net: Multi-scale Adaptive Attention-guided Network for Medical Image Segmentation | Jul 31, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| RefMask3D: Language-Guided Transformer for 3D Referring Segmentation | Jul 25, 2024 | 3D visual groundingImage Segmentation | CodeCode Available | 2 |
| ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation | Jul 19, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| GroupMamba: Efficient Group-Based Visual State Space Model | Jul 18, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes | Jul 16, 2024 | Human Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds | Jul 16, 2024 | LIDAR Semantic SegmentationSemantic Segmentation | CodeCode Available | 2 |
| DiffRect: Latent Diffusion Label Rectification for Semi-supervised Medical Image Segmentation | Jul 13, 2024 | DenoisingImage Segmentation | CodeCode Available | 2 |
| IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection | Jul 10, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift | Jul 10, 2024 | Change DetectionDisaster Response | CodeCode Available | 2 |
| Exploiting Scale-Variant Attention for Segmenting Small Medical Objects | Jul 10, 2024 | Cell SegmentationMRI segmentation | CodeCode Available | 2 |
| LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration | Jul 9, 2024 | 3D ReconstructionAutonomous Navigation | CodeCode Available | 2 |
| Training-free CryoET Tomogram Segmentation | Jul 8, 2024 | Contrastive LearningCryogenic Electron Tomography | CodeCode Available | 2 |