| Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing | Mar 3, 2019 | Referring Expression | —Unverified | 0 |
| Improving the generation of personalised descriptions | Sep 1, 2017 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Improving the Naturalness and Diversity of Referring Expression Generation models using Minimum Risk Training | Dec 1, 2020 | DiversityReferring Expression | —Unverified | 0 |
| Informativity in Image Captions vs. Referring Expressions | Jun 1, 2020 | Image CaptioningObject | —Unverified | 0 |
| Instance-Aware Generalized Referring Expression Segmentation | Nov 22, 2024 | Generalized Referring Expression SegmentationObject | —Unverified | 0 |
| Intrinsic Task-based Evaluation for Referring Expression Generation | Feb 12, 2024 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Justifying Corpus-Based Choices in Referring Expression Generation | Sep 1, 2013 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Key-Word-Aware Network for Referring Expression Image Segmentation | Sep 1, 2018 | Image SegmentationObject | —Unverified | 0 |
| Language Controls More Than Top-Down Attention: Modulating Bottom-Up Visual Processing with Referring Expressions | Jan 1, 2021 | Referring Expression | —Unverified | 0 |
| Language-Guided 3D Object Detection in Point Cloud for Autonomous Driving | May 25, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Language-Mediated, Object-Centric Representation Learning | Dec 31, 2020 | ObjectObject Discovery | —Unverified | 0 |
| Learning Distributions over Logical Forms for Referring Expression Generation | Oct 1, 2013 | Density EstimationReferring Expression | —Unverified | 0 |
| Learning Preferences for Referring Expression Generation: Effects of Domain, Language and Algorithm | May 1, 2012 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection | Dec 4, 2023 | Image to textobject-detection | —Unverified | 0 |
| Learning to Generate Unambiguous Spatial Referring Expressions for Real-World Environments | Apr 15, 2019 | Referring Expression | —Unverified | 0 |
| Learning to Reason and Navigate: Parameter Efficient Action Planning with Large Language Models | May 12, 2025 | NavigateReferring Expression | —Unverified | 0 |
| Learning to Represent Image and Text with Denotation Graph | Oct 6, 2020 | AttributeImage Retrieval | —Unverified | 0 |
| Learning Visual Grounding from Generative Vision and Language Model | Jul 18, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| Lessons from Computational Modelling of Reference Production in Mandarin and English | Nov 14, 2020 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Leveraging Non-Specialists for Accurate and Time Efficient AMR Annotation | May 1, 2020 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| Leveraging Past References for Robust Language Grounding | Nov 1, 2019 | ObjectReferring Expression | —Unverified | 0 |
| LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation | Apr 20, 2025 | AttributeImage Segmentation | —Unverified | 0 |
| Lite-MDETR: A Lightweight Multi-Modal Detector | Jan 1, 2022 | object-detectionObject Detection | —Unverified | 0 |
| Look Hear: Gaze Prediction for Speech-directed Human Attention | Jul 28, 2024 | DecoderGaze Prediction | —Unverified | 0 |
| M^2IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension | Jul 1, 2024 | GPUReferring Expression | —Unverified | 0 |
| MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language Queries at Phrase Level | Jun 6, 2020 | AttributeImage Captioning | —Unverified | 0 |
| Make Graph-based Referring Expression Comprehension Great Again through Expression-guided Dynamic Gating and Regression | Sep 5, 2024 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval | Jun 28, 2025 | Cross-Modal RetrievalImage Captioning | —Unverified | 0 |
| MaskInversion: Localized Embeddings via Optimization of Explainability Maps | Jul 29, 2024 | Image GenerationReferring Expression | —Unverified | 0 |
| Meta Compositional Referring Expression Segmentation | Apr 10, 2023 | Meta-LearningReferring Expression | —Unverified | 0 |
| Meteorologists and Students: A resource for language grounding of geographical descriptors | Sep 7, 2018 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Modeling Semantic Expectation: Using Script Knowledge for Referent Prediction | Feb 10, 2017 | Common Sense ReasoningPrediction | —Unverified | 0 |
| Modular Graph Attention Network for Complex Visual Relational Reasoning | Nov 22, 2020 | Graph AttentionQuestion Answering | —Unverified | 0 |
| MuDoCo: Corpus for Multidomain Coreference Resolution and Referring Expression Generation | May 1, 2020 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Multi-modal Domain Adaptation for REG via Relation Transfer | Sep 23, 2023 | Domain Adaptationimage-classification | —Unverified | 0 |
| Referring Expression Generation in time-constrained communication | May 1, 2018 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Referring Expression Generation under Uncertainty: Algorithm and Evaluation Framework | Sep 1, 2017 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Joint Visual Grounding with Language Scene Graphs | Jun 9, 2019 | Referring ExpressionVisual Grounding | —Unverified | 0 |
| Referring Expression Instance Retrieval and A Strong End-to-End Baseline | Jun 23, 2025 | Image RetrievalReferring Expression | —Unverified | 0 |
| Referring Expressions with Rational Speech Act Framework: A Probabilistic Approach | May 16, 2022 | Deep LearningReferring Expression | —Unverified | 0 |
| Referring Image Segmentation by Generative Adversarial Learning | Apr 20, 2020 | Image SegmentationReferring Expression | —Unverified | 0 |
| Referring Segmentation in Images and Videos with Cross-Modal Self-Attention Network | Feb 9, 2021 | Referring ExpressionReferring Expression Segmentation | —Unverified | 0 |
| Referring to what you know and do not know: Making Referring Expression Generation Models Generalize To Unseen Entities | Dec 1, 2020 | DecoderReferring Expression | —Unverified | 0 |
| Refer to Anything with Vision-Language Prompts | Jun 5, 2025 | BenchmarkingGeneralized Referring Expression Segmentation | —Unverified | 0 |
| RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension | Jan 1, 2023 | Imitation LearningPseudo Label | —Unverified | 0 |
| REMI: Mining Intuitive Referring Expressions on Knowledge Bases | Nov 4, 2019 | Inductive logic programmingReferring Expression | —Unverified | 0 |
| RESAnything: Attribute Prompting for Arbitrary Referring Segmentation | May 3, 2025 | AttributeImage Segmentation | —Unverified | 0 |
| RESMatch: Referring Expression Segmentation in a Semi-Supervised Manner | Feb 8, 2024 | Image SegmentationPseudo Label | —Unverified | 0 |
| Resolving Referring Expressions in Images With Labeled Elements | Oct 24, 2018 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Revisiting Multi-Modal LLM Evaluation | Aug 9, 2024 | Chart UnderstandingOptical Character Recognition | —Unverified | 0 |