Open Vocabulary Object Detection

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 145 papers

Title	Date	Tasks	Status	Hype
PointCLIP: Point Cloud Understanding by CLIP	Dec 4, 2021	3D Open-Vocabulary Instance SegmentationFew-Shot Learning	CodeCode Available	1
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection	Dec 12, 2023	object-detectionObject Detection	CodeCode Available	1
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers	May 11, 2023	Contrastive LearningImage-text Retrieval	CodeCode Available	1
RegionCLIP: Region-based Language-Image Pretraining	Dec 16, 2021	image-classificationImage Classification	CodeCode Available	1
Retrieval-Augmented Open-Vocabulary Object Detection	Apr 8, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection	May 30, 2024	Image CaptioningImage Inpainting	CodeCode Available	1
Open-Vocabulary Object Detection via Scene Graph Discovery	Jul 7, 2023	DecoderGraph Generation	—Unverified	0
An Application-Agnostic Automatic Target Recognition System Using Vision Language Models	Nov 5, 2024	object-detectionObject Detection	—Unverified	0
An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection	Mar 21, 2025	object-detectionObject Detection	—Unverified	0
ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction	Jun 10, 2025	object-detectionObject Detection	—Unverified	0
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs	Jul 3, 2024	Image CaptioningImage Generation	—Unverified	0
Boosting Open-Vocabulary Object Detection by Handling Background Samples	Oct 11, 2024	object-detectionObject Detection	—Unverified	0
Contrastive Feature Masking Open-Vocabulary Vision Transformer	Sep 2, 2023	Contrastive LearningImage-text Retrieval	—Unverified	0
DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction	Dec 9, 2024	Image Segmentationobject-detection	—Unverified	0
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment	Apr 10, 2023	Language Modellingobject-detection	—Unverified	0
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection	Apr 14, 2024	Dense CaptioningLanguage Modelling	—Unverified	0
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection	Mar 12, 2025	object-detectionObject Detection	—Unverified	0
EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment	Sep 3, 2023	Objectobject-detection	—Unverified	0
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting	Sep 19, 2024	DecoderObject	—Unverified	0
Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024	Jun 13, 2024	Objectobject-detection	—Unverified	0
Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection	Mar 17, 2023	AttributeContrastive Learning	—Unverified	0
Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection	Aug 30, 2023	Knowledge DistillationLanguage Modeling	—Unverified	0
Exploring Region-Word Alignment in Built-in Detector for Open-Vocabulary Object Detection	Jan 1, 2024	Decoderobject-detection	—Unverified	0
Few-shot target-driven instance detection based on open-vocabulary object detection models	Oct 21, 2024	Image AugmentationObject	—Unverified	0
Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation	Nov 23, 2024	Objectobject-detection	—Unverified	0

Show:10 25 50

← PrevPage 4 of 6Next →

All datasets MSCOCO LVIS v1.0 Objects365 OpenImages-v4

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Cooperative Foundational Models	AP 0.5	50.3	—	Unverified
2	DE-ViT	AP 0.5	50	—	Unverified
3	Yolov8-nano	AP 0.5	47.2	—	Unverified
4	DITO	AP 0.5	46.1	—	Unverified
5	OV-DQUO(RN50x4)	AP 0.5	45.6	—	Unverified
6	LP-OVOD (OWL-ViT Proposals)	AP 0.5	44.9	—	Unverified
7	CLIPSelf	AP 0.5	44.3	—	Unverified
8	CORA+	AP 0.5	43.1	—	Unverified
9	BARON	AP 0.5	42.7	—	Unverified
10	SIA-OVD (RN50x4)	AP 0.5	41.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LaMI-DETR	AP novel-LVIS base training	43.4	—	Unverified
2	DITO	AP novel-LVIS base training	40.4	—	Unverified
3	OV-DQUO(ViT-L/14)	AP novel-LVIS base training	39.3	—	Unverified
4	CoDet (EVA02-L)	AP novel-LVIS base training	37	—	Unverified
5	CLIPSelf	AP novel-LVIS base training	34.9	—	Unverified
6	OVMR	AP novel-LVIS base training	34.4	—	Unverified
7	DE-ViT	AP novel-LVIS base training	34.3	—	Unverified
8	CFM-ViT	AP novel-LVIS base training	33.9	—	Unverified
9	CLIM (RN50x64)	AP novel-LVIS base training	32.3	—	Unverified
10	RO-ViT	AP novel-LVIS base training	32.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	22.3	—	Unverified
2	ViLD	mask AP50	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	42.9	—	Unverified
2	Detic	mask AP50	42.2	—	Unverified