Open Vocabulary Object Detection

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 145 papers

Title	Date	Tasks	Status	Hype	Score
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection	Mar 10, 2023	ObjectOpen-vocabulary object detection	CodeCode Available	1	5
Meta-Adapter: An Online Few-shot Learner for Vision-Language Model	Nov 7, 2023	Few-Shot Learningimage-classification	CodeCode Available	1	5
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection	Oct 2, 2023	Novel Object DetectionObject	CodeCode Available	1	5
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching	Mar 23, 2023	Described Object Detectionobject-detection	CodeCode Available	1	5
Multi-Modal Classifiers for Open-Vocabulary Object Detection	Jun 8, 2023	Language ModellingLarge Language Model	CodeCode Available	1	5
MoCaE: Mixture of Calibrated Experts Significantly Improves Object Detection	Sep 26, 2023	Instance SegmentationMixture-of-Experts	CodeCode Available	1	5
Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection	Dec 23, 2024	object-detectionObject Detection	CodeCode Available	1	5
Distilling DETR with Visual-Linguistic Knowledge for Open-Vocabulary Object Detection	Jan 1, 2023	Knowledge DistillationLanguage Modeling	CodeCode Available	1	5
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection	Mar 13, 2025	object-detectionObject Detection	CodeCode Available	1	5
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection	Dec 22, 2023	Attributeobject-detection	CodeCode Available	1	5
MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection	Jul 31, 2024	Language ModellingObject	CodeCode Available	1	5
Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation	Mar 20, 2022	Knowledge DistillationLanguage Modelling	CodeCode Available	1	5
OvarNet: Towards Open-vocabulary Object Attribute Recognition	Jan 23, 2023	AttributeKnowledge Distillation	CodeCode Available	1	5
Localized Vision-Language Matching for Open-vocabulary Object Detection	May 12, 2022	Language ModelingLanguage Modelling	CodeCode Available	1	5
From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects	Nov 27, 2024	Autonomous DrivingObject	CodeCode Available	1	5
CLIM: Contrastive Language-Image Mosaic for Region Representation	Dec 18, 2023	Objectobject-detection	CodeCode Available	1	5
Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian	Aug 7, 2024	Autonomous Drivingobject-detection	CodeCode Available	1	5
Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model	Mar 28, 2022	image-classificationImage Classification	CodeCode Available	1	5
Described Object Detection: Liberating Object Detection with Flexible Expressions	Jul 24, 2023	Binary ClassificationDescribed Object Detection	CodeCode Available	1	5
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing	Oct 26, 2023	Objectobject-detection	CodeCode Available	1	5
Open-vocabulary Attribute Detection	Nov 23, 2022	AttributeLanguage Modeling	CodeCode Available	1	5
Open-Vocabulary Object Detection Using Captions	Nov 20, 2020	Objectobject-detection	CodeCode Available	1	5
Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning	Nov 20, 2023	Objectobject-detection	CodeCode Available	1	5
CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection	Oct 25, 2023	Objectobject-detection	CodeCode Available	1	5
DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training	Jul 12, 2024	Image GenerationObject	CodeCode Available	1	5

Show:10 25 50

← PrevPage 2 of 6Next →

All datasets MSCOCO LVIS v1.0 Objects365 OpenImages-v4

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Cooperative Foundational Models	AP 0.5	50.3	—	Unverified
2	DE-ViT	AP 0.5	50	—	Unverified
3	Yolov8-nano	AP 0.5	47.2	—	Unverified
4	DITO	AP 0.5	46.1	—	Unverified
5	OV-DQUO(RN50x4)	AP 0.5	45.6	—	Unverified
6	LP-OVOD (OWL-ViT Proposals)	AP 0.5	44.9	—	Unverified
7	CLIPSelf	AP 0.5	44.3	—	Unverified
8	CORA+	AP 0.5	43.1	—	Unverified
9	BARON	AP 0.5	42.7	—	Unverified
10	SIA-OVD (RN50x4)	AP 0.5	41.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LaMI-DETR	AP novel-LVIS base training	43.4	—	Unverified
2	DITO	AP novel-LVIS base training	40.4	—	Unverified
3	OV-DQUO(ViT-L/14)	AP novel-LVIS base training	39.3	—	Unverified
4	CoDet (EVA02-L)	AP novel-LVIS base training	37	—	Unverified
5	CLIPSelf	AP novel-LVIS base training	34.9	—	Unverified
6	OVMR	AP novel-LVIS base training	34.4	—	Unverified
7	DE-ViT	AP novel-LVIS base training	34.3	—	Unverified
8	CFM-ViT	AP novel-LVIS base training	33.9	—	Unverified
9	CLIM (RN50x64)	AP novel-LVIS base training	32.3	—	Unverified
10	RO-ViT	AP novel-LVIS base training	32.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	22.3	—	Unverified
2	ViLD	mask AP50	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	42.9	—	Unverified
2	Detic	mask AP50	42.2	—	Unverified