SOTAVerified

Open Vocabulary Object Detection

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Papers

Showing 5175 of 145 papers

TitleStatusHype
Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP0
Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 20240
OVMR: Open-Vocabulary Recognition with Multi-Modal ReferencesCode1
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection0
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object DetectionCode1
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects SupervisionCode1
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object DetectionCode2
Open-Vocabulary Object Detection via Neighboring Region Attention Alignment0
The devil is in the object boundary: towards annotation-free instance segmentation using Foundation ModelsCode1
Watch Your Step: Optimal Retrieval for Continual Learning at Scale0
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection0
Training-free Boost for Open-Vocabulary Object Detection with Confidence AggregationCode1
Retrieval-Augmented Open-Vocabulary Object DetectionCode1
Is CLIP the main roadblock for fine-grained open-world perception?Code2
Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts0
VisionGPT: LLM-Assisted Real-Time Anomaly Detection for Safe Visual NavigationCode1
Generative Region-Language Pretraining for Open-Ended Object DetectionCode2
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization0
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion HeadCode5
YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture DetectionCode2
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors0
Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object DetectorCode2
YOLO-World: Real-Time Open-Vocabulary Object DetectionCode9
LCV2: An Efficient Pretraining-Free Framework for Grounded Visual Question Answering0
Scene-adaptive and Region-aware Multi-modal Prompt for Open Vocabulary Object Detection0
Show:102550
← PrevPage 3 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Cooperative Foundational ModelsAP 0.550.3Unverified
2DE-ViTAP 0.550Unverified
3Yolov8-nanoAP 0.547.2Unverified
4DITOAP 0.546.1Unverified
5OV-DQUO(RN50x4)AP 0.545.6Unverified
6LP-OVOD (OWL-ViT Proposals)AP 0.544.9Unverified
7CLIPSelfAP 0.544.3Unverified
8CORA+AP 0.543.1Unverified
9BARONAP 0.542.7Unverified
10SIA-OVD (RN50x4)AP 0.541.9Unverified
#ModelMetricClaimedVerifiedStatus
1LaMI-DETRAP novel-LVIS base training43.4Unverified
2DITOAP novel-LVIS base training40.4Unverified
3OV-DQUO(ViT-L/14)AP novel-LVIS base training39.3Unverified
4CoDet (EVA02-L)AP novel-LVIS base training37Unverified
5CLIPSelfAP novel-LVIS base training34.9Unverified
6OVMRAP novel-LVIS base training34.4Unverified
7DE-ViTAP novel-LVIS base training34.3Unverified
8CFM-ViTAP novel-LVIS base training33.9Unverified
9CLIM (RN50x64)AP novel-LVIS base training32.3Unverified
10RO-ViTAP novel-LVIS base training32.1Unverified
#ModelMetricClaimedVerifiedStatus
1Object-Centric-OVDmask AP5022.3Unverified
2ViLDmask AP5018.2Unverified
#ModelMetricClaimedVerifiedStatus
1Object-Centric-OVDmask AP5042.9Unverified
2Deticmask AP5042.2Unverified