SOTAVerified

Open Vocabulary Object Detection

Open-vocabulary detection (OVD) aims to generalize beyond the limited number of base classes labeled during the training phase. The goal is to detect novel classes defined by an unbounded (open) vocabulary at inference.

Papers

Showing 101145 of 145 papers

TitleStatusHype
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text DescribabilityCode0
LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes0
Boosting Open-Vocabulary Object Detection by Handling Background Samples0
VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking0
Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval0
HA-FGOVD: Highlighting Fine-grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detection0
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting0
A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection TrainingCode0
On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes0
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP InversionCode0
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs0
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results0
Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP0
Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 20240
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection0
Open-Vocabulary Object Detection via Neighboring Region Attention Alignment0
Watch Your Step: Optimal Retrieval for Continual Learning at Scale0
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection0
Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts0
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization0
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors0
LCV2: An Efficient Pretraining-Free Framework for Grounded Visual Question Answering0
Exploring Region-Word Alignment in Built-in Detector for Open-Vocabulary Object Detection0
Scene-adaptive and Region-aware Multi-modal Prompt for Open Vocabulary Object Detection0
Generating Enhanced Negatives for Training Language-Based Object DetectorsCode0
Weakly Supervised Open-Vocabulary Object Detection0
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection0
Spuriosity Rankings for Free: A Simple Framework for Last Layer Retraining Based on Object Detection0
YOLOv8-Based Visual Detection of Road Hazards: Potholes, Sewer Covers, and Manholes0
Region-centric Image-Language Pretraining for Open-Vocabulary DetectionCode0
EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment0
Contrastive Feature Masking Open-Vocabulary Vision Transformer0
Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection0
Open-Vocabulary Object Detection via Scene Graph Discovery0
Scaling Open-Vocabulary Object DetectionCode0
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment0
MaMMUT: A Simple Architecture for Joint Learning for MultiModal TasksCode0
Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection0
Open-Vocabulary Object Detection using Pseudo Caption Labels0
Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection0
Open-Vocabulary Object Detection With an Open Corpus0
Learning to Detect and Segment for Open Vocabulary Object Detection0
Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection0
F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language ModelsCode0
Simple Open-Vocabulary Object Detection with Vision TransformersCode0
Show:102550
← PrevPage 3 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Cooperative Foundational ModelsAP 0.550.3Unverified
2DE-ViTAP 0.550Unverified
3Yolov8-nanoAP 0.547.2Unverified
4DITOAP 0.546.1Unverified
5OV-DQUO(RN50x4)AP 0.545.6Unverified
6LP-OVOD (OWL-ViT Proposals)AP 0.544.9Unverified
7CLIPSelfAP 0.544.3Unverified
8CORA+AP 0.543.1Unverified
9BARONAP 0.542.7Unverified
10SIA-OVD (RN50x4)AP 0.541.9Unverified
#ModelMetricClaimedVerifiedStatus
1LaMI-DETRAP novel-LVIS base training43.4Unverified
2DITOAP novel-LVIS base training40.4Unverified
3OV-DQUO(ViT-L/14)AP novel-LVIS base training39.3Unverified
4CoDet (EVA02-L)AP novel-LVIS base training37Unverified
5CLIPSelfAP novel-LVIS base training34.9Unverified
6OVMRAP novel-LVIS base training34.4Unverified
7DE-ViTAP novel-LVIS base training34.3Unverified
8CFM-ViTAP novel-LVIS base training33.9Unverified
9CLIM (RN50x64)AP novel-LVIS base training32.3Unverified
10RO-ViTAP novel-LVIS base training32.1Unverified
#ModelMetricClaimedVerifiedStatus
1Object-Centric-OVDmask AP5022.3Unverified
2ViLDmask AP5018.2Unverified
#ModelMetricClaimedVerifiedStatus
1Object-Centric-OVDmask AP5042.9Unverified
2Deticmask AP5042.2Unverified