PointCLIP: Point Cloud Understanding by CLIP Dec 4, 2021 3D Open-Vocabulary Instance Segmentation Few-Shot Learning
Code Code Available 15 ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection Dec 12, 2023 object-detection Object Detection
Code Code Available 15 Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers May 11, 2023 Contrastive Learning Image-text Retrieval
Code Code Available 15 RegionCLIP: Region-based Language-Image Pretraining Dec 16, 2021 image-classification Image Classification
Code Code Available 15 Retrieval-Augmented Open-Vocabulary Object Detection Apr 8, 2024 Language Modeling Language Modelling
Code Code Available 15 RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection May 30, 2024 Image Captioning Image Inpainting
Code Code Available 15 CLIM: Contrastive Language-Image Mosaic for Region Representation Dec 18, 2023 Object object-detection
Code Code Available 15 SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection Oct 8, 2024 object-detection Object Detection
Code Code Available 15 Simple Image-level Classification Improves Open-vocabulary Object Detection Dec 16, 2023 Knowledge Distillation Object
Code Code Available 15 Superpowering Open-Vocabulary Object Detectors for X-ray Vision Mar 21, 2025 object-detection Object Detection
Code Code Available 15 The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding Nov 29, 2023 Object object-detection
Code Code Available 15 The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models Apr 18, 2024 Instance Segmentation Object
Code Code Available 15 Open Vocabulary Object Detection with Pseudo Bounding-Box Labels Nov 18, 2021 Object object-detection
Code Code Available 15 Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation Apr 12, 2024 Object object-detection
Code Code Available 15 MoCaE: Mixture of Calibrated Experts Significantly Improves Object Detection Sep 26, 2023 Instance Segmentation Mixture-of-Experts
Code Code Available 15 Enhancing Novel Object Detection via Cooperative Foundational Models Nov 19, 2023 Novel Class Discovery Novel Object Detection
Code Code Available 15 CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching Mar 23, 2023 Described Object Detection object-detection
Code Code Available 15 Multi-Modal Classifiers for Open-Vocabulary Object Detection Jun 8, 2023 Language Modelling Large Language Model
Code Code Available 15 Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection Mar 10, 2023 Object Open-vocabulary object detection
Code Code Available 15 Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention Nov 18, 2023 Concept Alignment Graph Generation
Code Code Available 15 Exploiting Unlabeled Data with Vision and Language Models for Object Detection Jul 18, 2022 Object object-detection
Code Code Available 15 Open-vocabulary Attribute Detection Nov 23, 2022 Attribute Language Modeling
Code Code Available 15 Open-Vocabulary Object Detection Using Captions Nov 20, 2020 Object object-detection
Code Code Available 15 DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training Jul 12, 2024 Image Generation Object
Code Code Available 15 A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection Mar 13, 2025 object-detection Object Detection
Code Code Available 15 Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection Dec 23, 2024 object-detection Object Detection
Code Code Available 15 Localized Vision-Language Matching for Open-vocabulary Object Detection May 12, 2022 Language Modeling Language Modelling
Code Code Available 15 LP-OVOD: Open-Vocabulary Object Detection by Linear Probing Oct 26, 2023 Object object-detection
Code Code Available 15 Aligning Bag of Regions for Open-Vocabulary Object Detection Feb 27, 2023 Object object-detection
Code Code Available 15 MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection Jul 31, 2024 Language Modelling Object
Code Code Available 15 Meta-Adapter: An Online Few-shot Learner for Vision-Language Model Nov 7, 2023 Few-Shot Learning image-classification
Code Code Available 15 A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection Training Aug 20, 2024 Autonomous Vehicles Computational Efficiency
Code Code Available 05 F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models Sep 30, 2022 Knowledge Distillation object-detection
Code Code Available 05 Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection Mar 14, 2025 object-detection Object Detection
Code Code Available 05 LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation Mar 18, 2025 Decoder Object
Code Code Available 05 Scaling Open-Vocabulary Object Detection Jun 16, 2023 image-classification Image Classification
Code Code Available 05 Generating Enhanced Negatives for Training Language-Based Object Detectors Dec 29, 2023 Object object-detection
Code Code Available 05 Region-centric Image-Language Pretraining for Open-Vocabulary Detection Sep 29, 2023 Contrastive Learning Object
Code Code Available 05 MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks Mar 29, 2023 Cross-Modal Retrieval Decoder
Code Code Available 05 Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion Jul 15, 2024 image-classification Image Classification
Code Code Available 05 Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Oct 20, 2024 Few-Shot Object Detection image-classification
Code Code Available 05 Simple Open-Vocabulary Object Detection with Vision Transformers May 12, 2022 Described Object Detection image-classification
Code Code Available 05 Open-Vocabulary Object Detection via Scene Graph Discovery Jul 7, 2023 Decoder Graph Generation
— Unverified 00 An Application-Agnostic Automatic Target Recognition System Using Vision Language Models Nov 5, 2024 object-detection Object Detection
— Unverified 00 An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection Mar 21, 2025 object-detection Object Detection
— Unverified 00 ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction Jun 10, 2025 object-detection Object Detection
— Unverified 00 BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs Jul 3, 2024 Image Captioning Image Generation
— Unverified 00 Boosting Open-Vocabulary Object Detection by Handling Background Samples Oct 11, 2024 object-detection Object Detection
— Unverified 00 Contrastive Feature Masking Open-Vocabulary Vision Transformer Sep 2, 2023 Contrastive Learning Image-text Retrieval
— Unverified 00 DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction Dec 9, 2024 Image Segmentation object-detection
— Unverified 00