SOTAVerified

Open Vocabulary Object Detection

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Papers

Showing 110 of 145 papers

TitleStatusHype
ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction0
Gen-n-Val: Agentic Image Data Generation and Validation0
From Data to Modeling: Fully Open-vocabulary Scene Graph Generation0
FG-CLIP: Fine-Grained Visual and Textual AlignmentCode4
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language ModelCode9
Superpowering Open-Vocabulary Object Detectors for X-ray VisionCode1
An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection0
Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark0
LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data GenerationCode0
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object DetectionCode0
Show:102550
← PrevPage 1 of 15Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LaMI-DETRAP novel-LVIS base training43.4Unverified
2DITOAP novel-LVIS base training40.4Unverified
3OV-DQUO(ViT-L/14)AP novel-LVIS base training39.3Unverified
4CoDet (EVA02-L)AP novel-LVIS base training37Unverified
5CLIPSelfAP novel-LVIS base training34.9Unverified
6OVMRAP novel-LVIS base training34.4Unverified
7DE-ViTAP novel-LVIS base training34.3Unverified
8CFM-ViTAP novel-LVIS base training33.9Unverified
9CLIM (RN50x64)AP novel-LVIS base training32.3Unverified
10RO-ViTAP novel-LVIS base training32.1Unverified