Open Vocabulary Object Detection

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 145 papers

Title	Date	Tasks	Status	Hype
Open-Vocabulary One-Stage Detection with Hierarchical Visual-Language Knowledge Distillation	Mar 20, 2022	Knowledge DistillationLanguage Modelling	CodeCode Available	1
RegionCLIP: Region-based Language-Image Pretraining	Dec 16, 2021	image-classificationImage Classification	CodeCode Available	1
PointCLIP: Point Cloud Understanding by CLIP	Dec 4, 2021	3D Open-Vocabulary Instance SegmentationFew-Shot Learning	CodeCode Available	1
Open Vocabulary Object Detection with Pseudo Bounding-Box Labels	Nov 18, 2021	Objectobject-detection	CodeCode Available	1
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation	Apr 28, 2021	image-classificationImage Classification	CodeCode Available	1
Open-Vocabulary Object Detection Using Captions	Nov 20, 2020	Objectobject-detection	CodeCode Available	1
ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction	Jun 10, 2025	object-detectionObject Detection	—Unverified	0
Gen-n-Val: Agentic Image Data Generation and Validation	Jun 5, 2025	Image HarmonizationInstance Segmentation	—Unverified	0
From Data to Modeling: Fully Open-vocabulary Scene Graph Generation	May 26, 2025	Graph GenerationKnowledge Distillation	—Unverified	0
An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection	Mar 21, 2025	object-detectionObject Detection	—Unverified	0
Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark	Mar 19, 2025	Objectobject-detection	—Unverified	0
LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation	Mar 18, 2025	DecoderObject	CodeCode Available	0
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection	Mar 14, 2025	object-detectionObject Detection	CodeCode Available	0
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection	Mar 12, 2025	object-detectionObject Detection	—Unverified	0
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images	Mar 8, 2025	Objectobject-detection	—Unverified	0
MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering	Feb 23, 2025	Objectobject-detection	—Unverified	0
Learning the RoPEs: Better 2D and 3D Position Encodings with STRING	Feb 4, 2025	object-detectionObject Detection	—Unverified	0
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection	Jan 28, 2025	object-detectionObject Detection	—Unverified	0
Open-World Objectness Modeling Unifies Novel Object Detection	Jan 1, 2025	Novel Object Detectionobject-detection	—Unverified	0
Sampling Bag of Views for Open-Vocabulary Object Detection	Dec 24, 2024	object-detectionObject Detection	—Unverified	0
DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction	Dec 9, 2024	Image Segmentationobject-detection	—Unverified	0
Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation	Nov 23, 2024	Objectobject-detection	—Unverified	0
An Application-Agnostic Automatic Target Recognition System Using Vision Language Models	Nov 5, 2024	object-detectionObject Detection	—Unverified	0
Open-Vocabulary Object Detection via Language Hierarchy	Oct 27, 2024	Objectobject-detection	—Unverified	0
Few-shot target-driven instance detection based on open-vocabulary object detection models	Oct 21, 2024	Image AugmentationObject	—Unverified	0

Show:10 25 50

← PrevPage 4 of 6Next →

All datasets MSCOCO LVIS v1.0 Objects365 OpenImages-v4

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Cooperative Foundational Models	AP 0.5	50.3	—	Unverified
2	DE-ViT	AP 0.5	50	—	Unverified
3	Yolov8-nano	AP 0.5	47.2	—	Unverified
4	DITO	AP 0.5	46.1	—	Unverified
5	OV-DQUO(RN50x4)	AP 0.5	45.6	—	Unverified
6	LP-OVOD (OWL-ViT Proposals)	AP 0.5	44.9	—	Unverified
7	CLIPSelf	AP 0.5	44.3	—	Unverified
8	CORA+	AP 0.5	43.1	—	Unverified
9	BARON	AP 0.5	42.7	—	Unverified
10	SIA-OVD (RN50x4)	AP 0.5	41.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LaMI-DETR	AP novel-LVIS base training	43.4	—	Unverified
2	DITO	AP novel-LVIS base training	40.4	—	Unverified
3	OV-DQUO(ViT-L/14)	AP novel-LVIS base training	39.3	—	Unverified
4	CoDet (EVA02-L)	AP novel-LVIS base training	37	—	Unverified
5	CLIPSelf	AP novel-LVIS base training	34.9	—	Unverified
6	OVMR	AP novel-LVIS base training	34.4	—	Unverified
7	DE-ViT	AP novel-LVIS base training	34.3	—	Unverified
8	CFM-ViT	AP novel-LVIS base training	33.9	—	Unverified
9	CLIM (RN50x64)	AP novel-LVIS base training	32.3	—	Unverified
10	RO-ViT	AP novel-LVIS base training	32.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	22.3	—	Unverified
2	ViLD	mask AP50	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	42.9	—	Unverified
2	Detic	mask AP50	42.2	—	Unverified