| Grounding Everything: Emerging Localization Properties in Vision-Language Transformers | Dec 1, 2023 | Image RetrievalObject Localization | CodeCode Available | 1 |
| MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis | Jul 18, 2024 | 3D Semantic SegmentationSegmentation | CodeCode Available | 1 |
| Extract Free Dense Labels from CLIP | Dec 2, 2021 | Novel ConceptsOpen Vocabulary Panoptic Segmentation | CodeCode Available | 1 |
| SAM-UNet:Enhancing Zero-Shot Segmentation of SAM for Universal Medical Images | Aug 19, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| MatSAM: Efficient Extraction of Microstructures of Materials via Visual Large Model | Jan 11, 2024 | Image SegmentationPrompt Engineering | CodeCode Available | 1 |
| COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training | Dec 2, 2024 | Self-Supervised LearningSemantic Segmentation | CodeCode Available | 1 |
| GeoSAM: Fine-tuning SAM with Multi-Modal Prompts for Mobility Infrastructure Segmentation | Nov 19, 2023 | Image SegmentationLarge Language Model | CodeCode Available | 1 |
| Context-aware Feature Generation for Zero-shot Semantic Segmentation | Aug 16, 2020 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs | Dec 1, 2022 | Contrastive LearningOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medical Segmentation | Apr 25, 2023 | Computed Tomography (CT)Image Segmentation | CodeCode Available | 1 |