Zero-Shot Grounding of Objects from Natural Language Queries Aug 20, 2019 Natural Language Queries object-detection
Code Code Available 05 Read, look and detect: Bounding box annotation from image-caption pairs Jun 9, 2023 Object object-detection
— Unverified 00 Detailed Annotations of Chest X-Rays via CT Projection for Report Understanding Oct 7, 2022 Anatomy Phrase Grounding
— Unverified 00 CXR-Agent: Vision-language models for chest X-ray interpretation with uncertainty aware radiology reporting Jul 11, 2024 Data Augmentation Phrase Grounding
— Unverified 00 Medical Phrase Grounding with Region-Phrase Context Contrastive Alignment Mar 14, 2023 Medical Image Analysis Phrase Grounding
— Unverified 00 MedRG: Medical Report Grounding with Multi-modal Large Language Model Apr 10, 2024 Decoder Language Modeling
— Unverified 00 Utilizing Every Image Object for Semi-supervised Phrase Grounding Nov 5, 2020 Phrase Grounding Referring Expression
— Unverified 00 ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation Dec 12, 2024 Phrase Grounding Question Answering
— Unverified 00 A Comparison of Object Detection and Phrase Grounding Models in Chest X-ray Abnormality Localization using Eye-tracking Data Mar 2, 2025 object-detection Object Detection
— Unverified 00 Catalog Phrase Grounding (CPG): Grounding of Product Textual Attributes in Product Images for e-commerce Vision-Language Applications Aug 30, 2023 Decoder object-detection
— Unverified 00 Neural Sequential Phrase Grounding (SeqGROUND) Mar 18, 2019 Phrase Grounding
— Unverified 00 Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training Mar 4, 2024 Math Phrase Grounding
— Unverified 00 Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment Mar 27, 2019 Image Retrieval Phrase Grounding
— Unverified 00 Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling Sep 29, 2021 Contrastive Learning Phrase Grounding
— Unverified 00 Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding Jan 28, 2025 object-detection Object Detection
— Unverified 00 CAVL: Learning Contrastive and Adaptive Representations of Vision and Language Apr 10, 2023 Image Retrieval Phrase Grounding
— Unverified 00 Phrase Grounding-based Style Transfer for Single-Domain Generalized Object Detection Feb 2, 2024 object-detection Object Detection
— Unverified 00 Anatomy-Grounded Weakly Supervised Prompt Tuning for Chest X-ray Latent Diffusion Models Jun 12, 2025 Anatomy Image Generation
— Unverified 00 Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement Jan 21, 2024 Medical Image Analysis Phrase Grounding
— Unverified 00 PIRC Net : Using Proposal Indexing, Relationships and Context for Phrase Grounding Dec 7, 2018 Phrase Grounding Sentence
— Unverified 00 Pre-Training Multimodal Hallucination Detectors with Corrupted Grounding Data Aug 30, 2024 Hallucination Phrase Grounding
— Unverified 00 Grounding Plural Phrases: Countering Evaluation Biases by Individuation Jun 1, 2021 Phrase Grounding
— Unverified 00 Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension Jan 2, 2025 Generalized Referring Expression Comprehension Generalized Referring Expression Segmentation
— Unverified 00 How to Understand "Support"? An Implicit-enhanced Causal Inference Approach for Weakly-supervised Phrase Grounding Feb 29, 2024 Causal Inference counterfactual
— Unverified 00 Improving Pre-trained Vision-and-Language Embeddings for Phrase Grounding Nov 1, 2021 Multimodal Reasoning Phrase Grounding
— Unverified 00 Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection Mar 17, 2023 Attribute Contrastive Learning
— Unverified 00 Knowledge Aided Consistency for Weakly Supervised Phrase Grounding Mar 11, 2018 Phrase Grounding
— Unverified 00 ELVIS: Empowering Locality of Vision Language Pre-training with Intra-modal Similarity Apr 11, 2023 Phrase Grounding
— Unverified 00 Language Features Matter: Effective Language Representations for Vision-Language Tasks Aug 17, 2019 Image Captioning Language Modelling
— Unverified 00 Dynamic Conditional Networks for Few-Shot Learning Sep 1, 2018 Face Generation Few-Shot Learning
— Unverified 00 Learning Deep Structure-Preserving Image-Text Embeddings Nov 19, 2015 Image Retrieval Image to text
— Unverified 00 Progressive Local Alignment for Medical Multimodal Pre-training Feb 25, 2025 Contrastive Learning Image-text Retrieval
— Unverified 00 Propagating Over Phrase Relations for One-Stage Visual Grounding Aug 1, 2020 Phrase Grounding Relational Reasoning
— Unverified 00 Q-GroundCAM: Quantifying Grounding in Vision Language Models via GradCAM Apr 29, 2024 Phrase Grounding Scene Understanding
— Unverified 00 LIMITR: Leveraging Local Information for Medical Image-Text Representation Mar 21, 2023 Image Retrieval Phrase Grounding
— Unverified 00 Lite-MDETR: A Lightweight Multi-Modal Detector Jan 1, 2022 object-detection Object Detection
— Unverified 00 Query-guided Regression Network with Context Policy for Phrase Grounding Aug 4, 2017 Phrase Grounding regression
— Unverified 00 Disentangled Motif-aware Graph Learning for Phrase Grounding Apr 13, 2021 Diversity Graph Learning
— Unverified 00