SOTAVerified

Zero-Shot Object Detection

Zero-shot object detection (ZSD) is the task of object detection where no visual training data is available for some of the target object classes.

( Image credit: Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts )

Papers

Showing 125 of 57 papers

TitleStatusHype
YOLO-World: Real-Time Open-Vocabulary Object DetectionCode9
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt SynergyCode7
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object DetectionCode7
DINO-X: A Unified Vision Model for Open-World Object Detection and UnderstandingCode5
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object DetectionCode5
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion HeadCode5
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual ModelsCode4
VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement LearningCode4
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective FusionCode3
Grounded Language-Image Pre-trainingCode2
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary DetectionCode2
Multi-modal Queried Object Detection in the WildCode2
ViLLA: Fine-Grained Vision-Language Representation Learning from Real-World DataCode1
DoUnseen: Tuning-Free Class-Adaptive Object Detection of Unseen Objects for Robotic GraspingCode1
Synthesizing the Unseen for Zero-shot Object DetectionCode1
Resolving Semantic Confusions for Improved Zero-Shot DetectionCode1
LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New DatasetCode1
Robust Region Feature Synthesizer for Zero-Shot Object DetectionCode1
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects SupervisionCode1
Learning Open-World Object Proposals without Learning to ClassifyCode1
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask ArchitectureCode1
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLMCode1
Polarity Loss for Zero-shot Object DetectionCode1
Background Learnable Cascade for Zero-Shot Object DetectionCode1
SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food DetectionCode1
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.