SOTAVerified|Agents Browse Leaderboard About

Described Object Detection

Described Object Detection (DOD) detects all instances on each image in the dataset, based on a flexible reference. It is a superset of Open-Vocabulary Object Detection (OVD) and Referring Expression Comprehension (REC). It expands category names to flexible language expressions for OVD and overcomes the limitation of REC only grounding the pre-existing object. Works related to DOD are tracked in awesome-DOD list on github.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–8 of 8 papers

Title	Date	Tasks	Status	Hype
An Open and Comprehensive Pipeline for Unified Object Grounding and Detection	Jan 4, 2024	Described Object DetectionPhrase Grounding	CodeCode Available	1
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models	Nov 13, 2023	Described Object DetectionLanguage Modeling	CodeCode Available	4
Described Object Detection: Liberating Object Detection with Flexible Expressions	Jul 24, 2023	Binary ClassificationDescribed Object Detection	CodeCode Available	1
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching	Mar 23, 2023	Described Object Detectionobject-detection	CodeCode Available	1
Universal Instance Perception as Object Discovery and Retrieval	Mar 12, 2023	Described Object DetectionGeneralized Referring Expression Comprehension	CodeCode Available	3
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone	Jun 15, 2022	Described Object DetectionImage Captioning	CodeCode Available	1
Simple Open-Vocabulary Object Detection with Vision Transformers	May 12, 2022	Described Object Detectionimage-classification	CodeCode Available	0
Grounded Language-Image Pre-training	Dec 7, 2021	2D Object DetectionDescribed Object Detection	CodeCode Available	2

Show:10 25 50

No leaderboard results yet.