Open Vocabulary Object Detection

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 145 papers

Title	Date	Tasks	Status	Hype	Score
PointCLIP: Point Cloud Understanding by CLIP	Dec 4, 2021	3D Open-Vocabulary Instance SegmentationFew-Shot Learning	CodeCode Available	1	5
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection	Dec 12, 2023	object-detectionObject Detection	CodeCode Available	1	5
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers	May 11, 2023	Contrastive LearningImage-text Retrieval	CodeCode Available	1	5
RegionCLIP: Region-based Language-Image Pretraining	Dec 16, 2021	image-classificationImage Classification	CodeCode Available	1	5
Retrieval-Augmented Open-Vocabulary Object Detection	Apr 8, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection	May 30, 2024	Image CaptioningImage Inpainting	CodeCode Available	1	5
CLIM: Contrastive Language-Image Mosaic for Region Representation	Dec 18, 2023	Objectobject-detection	CodeCode Available	1	5
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection	Oct 8, 2024	object-detectionObject Detection	CodeCode Available	1	5
Simple Image-level Classification Improves Open-vocabulary Object Detection	Dec 16, 2023	Knowledge DistillationObject	CodeCode Available	1	5
Superpowering Open-Vocabulary Object Detectors for X-ray Vision	Mar 21, 2025	object-detectionObject Detection	CodeCode Available	1	5
The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding	Nov 29, 2023	Objectobject-detection	CodeCode Available	1	5
The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models	Apr 18, 2024	Instance SegmentationObject	CodeCode Available	1	5
Open Vocabulary Object Detection with Pseudo Bounding-Box Labels	Nov 18, 2021	Objectobject-detection	CodeCode Available	1	5
Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation	Apr 12, 2024	Objectobject-detection	CodeCode Available	1	5
MoCaE: Mixture of Calibrated Experts Significantly Improves Object Detection	Sep 26, 2023	Instance SegmentationMixture-of-Experts	CodeCode Available	1	5
Enhancing Novel Object Detection via Cooperative Foundational Models	Nov 19, 2023	Novel Class DiscoveryNovel Object Detection	CodeCode Available	1	5
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching	Mar 23, 2023	Described Object Detectionobject-detection	CodeCode Available	1	5
Multi-Modal Classifiers for Open-Vocabulary Object Detection	Jun 8, 2023	Language ModellingLarge Language Model	CodeCode Available	1	5
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection	Mar 10, 2023	ObjectOpen-vocabulary object detection	CodeCode Available	1	5
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention	Nov 18, 2023	Concept AlignmentGraph Generation	CodeCode Available	1	5
Exploiting Unlabeled Data with Vision and Language Models for Object Detection	Jul 18, 2022	Objectobject-detection	CodeCode Available	1	5
Open-vocabulary Attribute Detection	Nov 23, 2022	AttributeLanguage Modeling	CodeCode Available	1	5
Open-Vocabulary Object Detection Using Captions	Nov 20, 2020	Objectobject-detection	CodeCode Available	1	5
DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training	Jul 12, 2024	Image GenerationObject	CodeCode Available	1	5
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection	Mar 13, 2025	object-detectionObject Detection	CodeCode Available	1	5

Show:10 25 50

← PrevPage 3 of 6Next →

All datasets MSCOCO LVIS v1.0 Objects365 OpenImages-v4

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Cooperative Foundational Models	AP 0.5	50.3	—	Unverified
2	DE-ViT	AP 0.5	50	—	Unverified
3	Yolov8-nano	AP 0.5	47.2	—	Unverified
4	DITO	AP 0.5	46.1	—	Unverified
5	OV-DQUO(RN50x4)	AP 0.5	45.6	—	Unverified
6	LP-OVOD (OWL-ViT Proposals)	AP 0.5	44.9	—	Unverified
7	CLIPSelf	AP 0.5	44.3	—	Unverified
8	CORA+	AP 0.5	43.1	—	Unverified
9	BARON	AP 0.5	42.7	—	Unverified
10	SIA-OVD (RN50x4)	AP 0.5	41.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LaMI-DETR	AP novel-LVIS base training	43.4	—	Unverified
2	DITO	AP novel-LVIS base training	40.4	—	Unverified
3	OV-DQUO(ViT-L/14)	AP novel-LVIS base training	39.3	—	Unverified
4	CoDet (EVA02-L)	AP novel-LVIS base training	37	—	Unverified
5	CLIPSelf	AP novel-LVIS base training	34.9	—	Unverified
6	OVMR	AP novel-LVIS base training	34.4	—	Unverified
7	DE-ViT	AP novel-LVIS base training	34.3	—	Unverified
8	CFM-ViT	AP novel-LVIS base training	33.9	—	Unverified
9	CLIM (RN50x64)	AP novel-LVIS base training	32.3	—	Unverified
10	RO-ViT	AP novel-LVIS base training	32.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	22.3	—	Unverified
2	ViLD	mask AP50	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	42.9	—	Unverified
2	Detic	mask AP50	42.2	—	Unverified