Open Vocabulary Object Detection

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–145 of 145 papers

Title	Date	Tasks	Status
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability	Oct 20, 2024	Few-Shot Object Detectionimage-classification	CodeCode Available
LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes	Oct 18, 2024	3D geometryobject-detection	—Unverified
Boosting Open-Vocabulary Object Detection by Handling Background Samples	Oct 11, 2024	object-detectionObject Detection	—Unverified
VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking	Oct 11, 2024	Multi-Object TrackingObject	—Unverified
Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval	Sep 26, 2024	Image RetrievalObject	—Unverified
HA-FGOVD: Highlighting Fine-grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detection	Sep 24, 2024	Attributeobject-detection	—Unverified
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting	Sep 19, 2024	DecoderObject	—Unverified
A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection Training	Aug 20, 2024	Autonomous VehiclesComputational Efficiency	CodeCode Available
On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes	Aug 20, 2024	Objectobject-detection	—Unverified
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion	Jul 15, 2024	image-classificationImage Classification	CodeCode Available
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs	Jul 3, 2024	Image CaptioningImage Generation	—Unverified
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results	Jun 17, 2024	Objectobject-detection	—Unverified
Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP	Jun 16, 2024	object-detectionObject Detection	—Unverified
Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024	Jun 13, 2024	Objectobject-detection	—Unverified
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection	Jun 1, 2024	Knowledge DistillationObject	—Unverified
Open-Vocabulary Object Detection via Neighboring Region Attention Alignment	May 14, 2024	Diversityobject-detection	—Unverified
Watch Your Step: Optimal Retrieval for Continual Learning at Scale	Apr 16, 2024	Continual Learningobject-detection	—Unverified
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection	Apr 14, 2024	Dense CaptioningLanguage Modelling	—Unverified
Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts	Apr 1, 2024	Objectobject-detection	—Unverified
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization	Mar 14, 2024	Contrastive LearningKnowledge Distillation	—Unverified
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors	Feb 7, 2024	image-classificationImage Classification	—Unverified
LCV2: An Efficient Pretraining-Free Framework for Grounded Visual Question Answering	Jan 29, 2024	Language ModelingLanguage Modelling	—Unverified
Exploring Region-Word Alignment in Built-in Detector for Open-Vocabulary Object Detection	Jan 1, 2024	Decoderobject-detection	—Unverified
Scene-adaptive and Region-aware Multi-modal Prompt for Open Vocabulary Object Detection	Jan 1, 2024	Knowledge Distillationobject-detection	—Unverified
Generating Enhanced Negatives for Training Language-Based Object Detectors	Dec 29, 2023	Objectobject-detection	CodeCode Available
Weakly Supervised Open-Vocabulary Object Detection	Dec 19, 2023	AttributeNovel Concepts	—Unverified
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection	Dec 4, 2023	Image to textobject-detection	—Unverified
Spuriosity Rankings for Free: A Simple Framework for Last Layer Retraining Based on Object Detection	Oct 31, 2023	Objectobject-detection	—Unverified
YOLOv8-Based Visual Detection of Road Hazards: Potholes, Sewer Covers, and Manholes	Oct 31, 2023	Computational Efficiencyobject-detection	—Unverified
Region-centric Image-Language Pretraining for Open-Vocabulary Detection	Sep 29, 2023	Contrastive LearningObject	CodeCode Available
EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment	Sep 3, 2023	Objectobject-detection	—Unverified
Contrastive Feature Masking Open-Vocabulary Vision Transformer	Sep 2, 2023	Contrastive LearningImage-text Retrieval	—Unverified
Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection	Aug 30, 2023	Knowledge DistillationLanguage Modeling	—Unverified
Open-Vocabulary Object Detection via Scene Graph Discovery	Jul 7, 2023	DecoderGraph Generation	—Unverified
Scaling Open-Vocabulary Object Detection	Jun 16, 2023	image-classificationImage Classification	CodeCode Available
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment	Apr 10, 2023	Language Modellingobject-detection	—Unverified
MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks	Mar 29, 2023	Cross-Modal RetrievalDecoder	CodeCode Available
Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection	Mar 25, 2023	Decoderobject-detection	—Unverified
Open-Vocabulary Object Detection using Pseudo Caption Labels	Mar 23, 2023	Image CaptioningKnowledge Distillation	—Unverified
Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection	Mar 17, 2023	AttributeContrastive Learning	—Unverified
Open-Vocabulary Object Detection With an Open Corpus	Jan 1, 2023	Objectobject-detection	—Unverified
Learning to Detect and Segment for Open Vocabulary Object Detection	Dec 23, 2022	Objectobject-detection	—Unverified
Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection	Nov 2, 2022	Objectobject-detection	—Unverified
F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models	Sep 30, 2022	Knowledge Distillationobject-detection	CodeCode Available
Simple Open-Vocabulary Object Detection with Vision Transformers	May 12, 2022	Described Object Detectionimage-classification	CodeCode Available

Show:10 25 50

← PrevPage 3 of 3Next →

All datasets MSCOCO LVIS v1.0 Objects365 OpenImages-v4

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Cooperative Foundational Models	AP 0.5	50.3	—	Unverified
2	DE-ViT	AP 0.5	50	—	Unverified
3	Yolov8-nano	AP 0.5	47.2	—	Unverified
4	DITO	AP 0.5	46.1	—	Unverified
5	OV-DQUO(RN50x4)	AP 0.5	45.6	—	Unverified
6	LP-OVOD (OWL-ViT Proposals)	AP 0.5	44.9	—	Unverified
7	CLIPSelf	AP 0.5	44.3	—	Unverified
8	CORA+	AP 0.5	43.1	—	Unverified
9	BARON	AP 0.5	42.7	—	Unverified
10	SIA-OVD (RN50x4)	AP 0.5	41.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LaMI-DETR	AP novel-LVIS base training	43.4	—	Unverified
2	DITO	AP novel-LVIS base training	40.4	—	Unverified
3	OV-DQUO(ViT-L/14)	AP novel-LVIS base training	39.3	—	Unverified
4	CoDet (EVA02-L)	AP novel-LVIS base training	37	—	Unverified
5	CLIPSelf	AP novel-LVIS base training	34.9	—	Unverified
6	OVMR	AP novel-LVIS base training	34.4	—	Unverified
7	DE-ViT	AP novel-LVIS base training	34.3	—	Unverified
8	CFM-ViT	AP novel-LVIS base training	33.9	—	Unverified
9	CLIM (RN50x64)	AP novel-LVIS base training	32.3	—	Unverified
10	RO-ViT	AP novel-LVIS base training	32.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	22.3	—	Unverified
2	ViLD	mask AP50	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Object-Centric-OVD	mask AP50	42.9	—	Unverified
2	Detic	mask AP50	42.2	—	Unverified