| DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment | Dec 20, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 0 |
| Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation | Dec 18, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation | Dec 16, 2024 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation | Dec 12, 2024 | Domain AdaptationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation | Dec 5, 2024 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance | Dec 3, 2024 | 3DGS3D Reconstruction | —Unverified | 0 |
| LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation | Nov 30, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation | Nov 26, 2024 | ObjectOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| HyperSeg: Towards Universal Visual Segmentation with Large Language Model | Nov 26, 2024 | Language ModelingLarge Language Model | CodeCode Available | 2 |
| Effective SAM Combination for Open-Vocabulary Semantic Segmentation | Nov 22, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation | Nov 21, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation | Nov 20, 2024 | 3D geometry3D Semantic Segmentation | CodeCode Available | 1 |
| ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements | Nov 18, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation | Nov 15, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| InvSeg: Test-Time Prompt Inversion for Semantic Segmentation | Oct 15, 2024 | Image GenerationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| 3D Vision-Language Gaussian Splatting | Oct 10, 2024 | 3D ReconstructionAutonomous Driving | —Unverified | 0 |
| Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments | Oct 9, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images | Oct 2, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 3 |
| Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels | Sep 30, 2024 | Online ClusteringOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise | Sep 21, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Image Segmentation | Aug 27, 2024 | Image SegmentationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras | Aug 18, 2024 | Autonomous DrivingDomain Adaptation | CodeCode Available | 0 |
| In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Aug 9, 2024 | Image to textObject | CodeCode Available | 2 |
| ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation | Aug 9, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Aug 1, 2024 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |