ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction Jun 10, 2025 object-detection Object Detection
— Unverified 0Gen-n-Val: Agentic Image Data Generation and Validation Jun 5, 2025 Image Harmonization Instance Segmentation
— Unverified 0From Data to Modeling: Fully Open-vocabulary Scene Graph Generation May 26, 2025 Graph Generation Knowledge Distillation
— Unverified 0FG-CLIP: Fine-Grained Visual and Textual Alignment May 8, 2025 Image-text Retrieval object-detection
Code Code Available 4VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Apr 10, 2025 Language Modeling Language Modelling
Code Code Available 9Superpowering Open-Vocabulary Object Detectors for X-ray Vision Mar 21, 2025 object-detection Object Detection
Code Code Available 1An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection Mar 21, 2025 object-detection Object Detection
— Unverified 0Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark Mar 19, 2025 Object object-detection
— Unverified 0LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation Mar 18, 2025 Decoder Object
Code Code Available 0Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection Mar 14, 2025 object-detection Object Detection
Code Code Available 0A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection Mar 13, 2025 object-detection Object Detection
Code Code Available 1DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection Mar 12, 2025 object-detection Object Detection
— Unverified 0Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement Mar 9, 2025 Domain Generalization Object Detection
Code Code Available 4OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images Mar 8, 2025 Object object-detection
— Unverified 0Visual-RFT: Visual Reinforcement Fine-Tuning Mar 3, 2025 Few-Shot Object Detection Fine-Grained Image Classification
Code Code Available 7MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering Feb 23, 2025 Object object-detection
— Unverified 0Learning the RoPEs: Better 2D and 3D Position Encodings with STRING Feb 4, 2025 object-detection Object Detection
— Unverified 0Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection Jan 28, 2025 object-detection Object Detection
— Unverified 0OW-OVD: Unified Open World and Open Vocabulary Object Detection Jan 1, 2025 Attribute Incremental Learning
Code Code Available 1Open-World Objectness Modeling Unifies Novel Object Detection Jan 1, 2025 Novel Object Detection object-detection
— Unverified 0Sampling Bag of Views for Open-Vocabulary Object Detection Dec 24, 2024 object-detection Object Detection
— Unverified 0Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection Dec 23, 2024 object-detection Object Detection
Code Code Available 1DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction Dec 9, 2024 Image Segmentation object-detection
— Unverified 0From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects Nov 27, 2024 Autonomous Driving Object
Code Code Available 1Open Vocabulary Monocular 3D Object Detection Nov 25, 2024 3D Object Detection Monocular 3D Object Detection
Code Code Available 2Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation Nov 23, 2024 Object object-detection
— Unverified 0An Application-Agnostic Automatic Target Recognition System Using Vision Language Models Nov 5, 2024 object-detection Object Detection
— Unverified 0Open-Vocabulary Object Detection via Language Hierarchy Oct 27, 2024 Object object-detection
— Unverified 0OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking Oct 23, 2024 Multi-Object Tracking Object
Code Code Available 1Few-shot target-driven instance detection based on open-vocabulary object detection models Oct 21, 2024 Image Augmentation Object
— Unverified 0Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Oct 20, 2024 Few-Shot Object Detection image-classification
Code Code Available 0LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes Oct 18, 2024 3D geometry object-detection
— Unverified 0Boosting Open-Vocabulary Object Detection by Handling Background Samples Oct 11, 2024 object-detection Object Detection
— Unverified 0VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking Oct 11, 2024 Multi-Object Tracking Object
— Unverified 0SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection Oct 8, 2024 object-detection Object Detection
Code Code Available 1Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval Sep 26, 2024 Image Retrieval Object
— Unverified 0HA-FGOVD: Highlighting Fine-grained Attributes via Explicit Linear Composition for Open-Vocabulary Object Detection Sep 24, 2024 Attribute object-detection
— Unverified 0End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting Sep 19, 2024 Decoder Object
— Unverified 0Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection Sep 13, 2024 Mamba Open Vocabulary Object Detection
Code Code Available 2A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection Training Aug 20, 2024 Autonomous Vehicles Computational Efficiency
Code Code Available 0On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes Aug 20, 2024 Object object-detection
— Unverified 0Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community Aug 17, 2024 Novel Concepts Object
Code Code Available 3Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian Aug 7, 2024 Autonomous Driving object-detection
Code Code Available 1MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection Jul 31, 2024 Language Modelling Object
Code Code Available 1LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction Jul 16, 2024 Language Modeling Language Modelling
Code Code Available 2Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion Jul 15, 2024 image-classification Image Classification
Code Code Available 0OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer Jul 15, 2024 Language Modeling Language Modelling
Code Code Available 3DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training Jul 12, 2024 Image Generation Object
Code Code Available 1BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs Jul 3, 2024 Image Captioning Image Generation
— Unverified 0V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results Jun 17, 2024 Object object-detection
— Unverified 0