Open Vocabulary Object Detection

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 145 papers

Title	Date	Tasks	Status	Hype
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation	Sep 1, 2023	3D Open-Vocabulary Instance Segmentation3D Open-Vocabulary Object Detection	CodeCode Available	2
Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection	Aug 30, 2023	Knowledge DistillationLanguage Modeling	—Unverified	0
Taming Self-Training for Open-Vocabulary Object Detection	Aug 11, 2023	Objectobject-detection	CodeCode Available	1
Described Object Detection: Liberating Object Detection with Flexible Expressions	Jul 24, 2023	Binary ClassificationDescribed Object Detection	CodeCode Available	1
Open-Vocabulary Object Detection via Scene Graph Discovery	Jul 7, 2023	DecoderGraph Generation	—Unverified	0
Scaling Open-Vocabulary Object Detection	Jun 16, 2023	image-classificationImage Classification	CodeCode Available	0
Multi-Modal Classifiers for Open-Vocabulary Object Detection	Jun 8, 2023	Language ModellingLarge Language Model	CodeCode Available	1
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers	May 11, 2023	Contrastive LearningImage-text Retrieval	CodeCode Available	1
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment	Apr 10, 2023	Language Modellingobject-detection	—Unverified	0
V3Det: Vast Vocabulary Visual Detection Dataset	Apr 7, 2023	ChatbotObject	CodeCode Available	1
MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks	Mar 29, 2023	Cross-Modal RetrievalDecoder	CodeCode Available	0
Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection	Mar 25, 2023	Decoderobject-detection	—Unverified	0
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching	Mar 23, 2023	Described Object Detectionobject-detection	CodeCode Available	1
Open-Vocabulary Object Detection using Pseudo Caption Labels	Mar 23, 2023	Image CaptioningKnowledge Distillation	—Unverified	0
Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection	Mar 17, 2023	AttributeContrastive Learning	—Unverified	0
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection	Mar 10, 2023	ObjectOpen-vocabulary object detection	CodeCode Available	1
Aligning Bag of Regions for Open-Vocabulary Object Detection	Feb 27, 2023	Objectobject-detection	CodeCode Available	1
OvarNet: Towards Open-vocabulary Object Attribute Recognition	Jan 23, 2023	AttributeKnowledge Distillation	CodeCode Available	1
Open-Vocabulary Object Detection With an Open Corpus	Jan 1, 2023	Objectobject-detection	—Unverified	0
Distilling DETR with Visual-Linguistic Knowledge for Open-Vocabulary Object Detection	Jan 1, 2023	Knowledge DistillationLanguage Modeling	CodeCode Available	1
Learning To Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained Visual-Semantic Space	Jan 1, 2023	Graph Generationobject-detection	CodeCode Available	1
Learning to Detect and Segment for Open Vocabulary Object Detection	Dec 23, 2022	Objectobject-detection	—Unverified	0
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion	Dec 7, 2022	Data AugmentationInstance Segmentation	CodeCode Available	1
Learning Object-Language Alignments for Open-Vocabulary Object Detection	Nov 27, 2022	Objectobject-detection	CodeCode Available	1
Open-vocabulary Attribute Detection	Nov 23, 2022	AttributeLanguage Modeling	CodeCode Available	1

Show:10 25 50

← PrevPage 5 of 6Next →

All datasets MSCOCO LVIS v1.0 Objects365 OpenImages-v4

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Cooperative Foundational Models	AP 0.5	50.3	—	Unverified
2	DE-ViT	AP 0.5	50	—	Unverified
3	Yolov8-nano	AP 0.5	47.2	—	Unverified
4	DITO	AP 0.5	46.1	—	Unverified
5	OV-DQUO(RN50x4)	AP 0.5	45.6	—	Unverified
6	LP-OVOD (OWL-ViT Proposals)	AP 0.5	44.9	—	Unverified
7	CLIPSelf	AP 0.5	44.3	—	Unverified
8	CORA+	AP 0.5	43.1	—	Unverified
9	BARON	AP 0.5	42.7	—	Unverified
10	SIA-OVD (RN50x4)	AP 0.5	41.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LaMI-DETR	AP novel-LVIS base training	43.4	—	Unverified
2	DITO	AP novel-LVIS base training	40.4	—	Unverified
3	OV-DQUO(ViT-L/14)	AP novel-LVIS base training	39.3	—	Unverified
4	CoDet (EVA02-L)	AP novel-LVIS base training	37	—	Unverified
5	CLIPSelf	AP novel-LVIS base training	34.9	—	Unverified
6	OVMR	AP novel-LVIS base training	34.4	—	Unverified
7	DE-ViT	AP novel-LVIS base training	34.3	—	Unverified
8	CFM-ViT	AP novel-LVIS base training	33.9	—	Unverified
9	CLIM (RN50x64)	AP novel-LVIS base training	32.3	—	Unverified
10	RO-ViT	AP novel-LVIS base training	32.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	22.3	—	Unverified
2	ViLD	mask AP50	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	42.9	—	Unverified
2	Detic	mask AP50	42.2	—	Unverified