Open Vocabulary Object Detection

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 145 papers

Title	Date	Tasks	Status
Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark	Mar 19, 2025	Objectobject-detection	—Unverified
From Data to Modeling: Fully Open-vocabulary Scene Graph Generation	May 26, 2025	Graph GenerationKnowledge Distillation	—Unverified
Gen-n-Val: Agentic Image Data Generation and Validation	Jun 5, 2025	Image HarmonizationInstance Segmentation	—Unverified
HA-FGOVD: Highlighting Fine-grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detection	Sep 24, 2024	Attributeobject-detection	—Unverified
Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts	Apr 1, 2024	Objectobject-detection	—Unverified
LCV2: An Efficient Pretraining-Free Framework for Grounded Visual Question Answering	Jan 29, 2024	Language ModelingLanguage Modelling	—Unverified
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection	Jun 1, 2024	Knowledge DistillationObject	—Unverified
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection	Dec 4, 2023	Image to textobject-detection	—Unverified
Learning the RoPEs: Better 2D and 3D Position Encodings with STRING	Feb 4, 2025	object-detectionObject Detection	—Unverified
Learning to Detect and Segment for Open Vocabulary Object Detection	Dec 23, 2022	Objectobject-detection	—Unverified
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors	Feb 7, 2024	image-classificationImage Classification	—Unverified
LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes	Oct 18, 2024	3D geometryobject-detection	—Unverified
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection	Jan 28, 2025	object-detectionObject Detection	—Unverified
MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering	Feb 23, 2025	Objectobject-detection	—Unverified
On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes	Aug 20, 2024	Objectobject-detection	—Unverified
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images	Mar 8, 2025	Objectobject-detection	—Unverified
Open-Vocabulary Object Detection using Pseudo Caption Labels	Mar 23, 2023	Image CaptioningKnowledge Distillation	—Unverified
Open-Vocabulary Object Detection via Neighboring Region Attention Alignment	May 14, 2024	Diversityobject-detection	—Unverified
Open-Vocabulary Object Detection via Language Hierarchy	Oct 27, 2024	Objectobject-detection	—Unverified
Open-Vocabulary Object Detection With an Open Corpus	Jan 1, 2023	Objectobject-detection	—Unverified
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization	Mar 14, 2024	Contrastive LearningKnowledge Distillation	—Unverified
Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP	Jun 16, 2024	object-detectionObject Detection	—Unverified
Open-World Objectness Modeling Unifies Novel Object Detection	Jan 1, 2025	Novel Object Detectionobject-detection	—Unverified
Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection	Nov 2, 2022	Objectobject-detection	—Unverified
Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection	Mar 25, 2023	Decoderobject-detection	—Unverified

Show:10 25 50

← PrevPage 5 of 6Next →

All datasets MSCOCO LVIS v1.0 Objects365 OpenImages-v4

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Cooperative Foundational Models	AP 0.5	50.3	—	Unverified
2	DE-ViT	AP 0.5	50	—	Unverified
3	Yolov8-nano	AP 0.5	47.2	—	Unverified
4	DITO	AP 0.5	46.1	—	Unverified
5	OV-DQUO(RN50x4)	AP 0.5	45.6	—	Unverified
6	LP-OVOD (OWL-ViT Proposals)	AP 0.5	44.9	—	Unverified
7	CLIPSelf	AP 0.5	44.3	—	Unverified
8	CORA+	AP 0.5	43.1	—	Unverified
9	BARON	AP 0.5	42.7	—	Unverified
10	SIA-OVD (RN50x4)	AP 0.5	41.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LaMI-DETR	AP novel-LVIS base training	43.4	—	Unverified
2	DITO	AP novel-LVIS base training	40.4	—	Unverified
3	OV-DQUO(ViT-L/14)	AP novel-LVIS base training	39.3	—	Unverified
4	CoDet (EVA02-L)	AP novel-LVIS base training	37	—	Unverified
5	CLIPSelf	AP novel-LVIS base training	34.9	—	Unverified
6	OVMR	AP novel-LVIS base training	34.4	—	Unverified
7	DE-ViT	AP novel-LVIS base training	34.3	—	Unverified
8	CFM-ViT	AP novel-LVIS base training	33.9	—	Unverified
9	CLIM (RN50x64)	AP novel-LVIS base training	32.3	—	Unverified
10	RO-ViT	AP novel-LVIS base training	32.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	22.3	—	Unverified
2	ViLD	mask AP50	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	42.9	—	Unverified
2	Detic	mask AP50	42.2	—	Unverified