Read, look and detect: Bounding box annotation from image-caption pairs Jun 9, 2023 Object object-detection
— Unverified 0Catalog Phrase Grounding (CPG): Grounding of Product Textual Attributes in Product Images for e-commerce Vision-Language Applications Aug 30, 2023 Decoder object-detection
— Unverified 0Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection Mar 17, 2023 Attribute Contrastive Learning
— Unverified 0Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement Jan 21, 2024 Medical Image Analysis Phrase Grounding
— Unverified 0Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling Sep 29, 2021 Contrastive Learning Phrase Grounding
— Unverified 0Utilizing Every Image Object for Semi-supervised Phrase Grounding Nov 5, 2020 Phrase Grounding Referring Expression
— Unverified 0ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation Dec 12, 2024 Phrase Grounding Question Answering
— Unverified 0Zero-Shot Medical Phrase Grounding with Off-the-shelf Diffusion Models Apr 19, 2024 Contrastive Learning Phrase Grounding
Code Code Available 0A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models Sep 6, 2023 Phrase Grounding
Code Code Available 0Anatomical grounding pre-training for medical phrase grounding Feb 23, 2025 Phrase Grounding Zero-Shot Learning
Code Code Available 0Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models Nov 5, 2023 Data Augmentation Phrase Grounding
Code Code Available 0Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks Sep 7, 2023 Object Discovery Phrase Grounding
Code Code Available 0Conditional Image-Text Embedding Networks Nov 22, 2017 Phrase Grounding
Code Code Available 0Context-Infused Visual Grounding for Art Oct 16, 2024 object-detection Object Detection
Code Code Available 0Detector-Free Weakly Supervised Grounding by Separation Apr 20, 2021 Phrase Grounding
Code Code Available 0Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures May 16, 2025 coreference-resolution Coreference Resolution
Code Code Available 0Empathic Grounding: Explorations using Multimodal Interaction and Large Language Models with Conversational Agents Jul 1, 2024 Emotional Intelligence Emotion Classification
Code Code Available 0Extending Phrase Grounding with Pronouns in Visual Dialogues Oct 23, 2022 Phrase Grounding
Code Code Available 0Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring Mar 14, 2024 Object Object Counting
Code Code Available 0Grounding of Textual Phrases in Images by Reconstruction Nov 12, 2015 Language Modeling Language Modelling
Code Code Available 0Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing Jan 11, 2023 Phrase Grounding Self-Supervised Learning
Code Code Available 0Learning to ground medical text in a 3D human atlas Nov 1, 2020 Phrase Grounding Visual Grounding
Code Code Available 0A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection Training Aug 20, 2024 Autonomous Vehicles Computational Efficiency
Code Code Available 0Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge Oct 23, 2023 Phrase Grounding World Knowledge
Code Code Available 0Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing Apr 21, 2022 Contrastive Learning Language Modeling
Code Code Available 0Modularized Textual Grounding for Counterfactual Resilience Apr 7, 2019 Attribute counterfactual
Code Code Available 0Multi-level Multimodal Common Semantic Space for Image-Phrase Grounding Nov 28, 2018 Language Modeling Language Modelling
Code Code Available 0Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding Jun 6, 2016 Phrase Grounding Visual Grounding
Code Code Available 0Natural Language Object Retrieval Nov 13, 2015 Image Captioning Image Retrieval
Code Code Available 0Revisiting Image-Language Networks for Open-ended Phrase Detection Nov 17, 2018 object-detection Object Detection
Code Code Available 0Trade-offs in Fine-tuned Diffusion Models Between Accuracy and Interpretability Mar 31, 2023 Conditional Image Generation Image Generation
Code Code Available 0Phrase Grounding by Soft-Label Chain Conditional Random Field Sep 1, 2019 Phrase Grounding Structured Prediction
Code Code Available 0Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding May 9, 2018 Diversity Phrase Grounding
Code Code Available 0Neural Parameter Allocation Search Jun 18, 2020 Image Classification Phrase Grounding
Code Code Available 0Similarity Maps for Self-Training Weakly-Supervised Phrase Grounding Jan 1, 2023 Phrase Grounding
Code Code Available 0Transformer with Controlled Attention for Synchronous Motion Captioning Sep 13, 2024 Action Localization Action Segmentation
Code Code Available 0VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback Jan 29, 2025 Phrase Grounding
Code Code Available 0Zero-Shot Grounding of Objects from Natural Language Queries Aug 20, 2019 Natural Language Queries object-detection
Code Code Available 0