| Giving Commands to a Self-driving Car: A Multimodal Reasoner for Visual Grounding | Mar 19, 2020 | ObjectReferring Expression Comprehension | —Unverified | 0 |
| MUTATT: Visual-Textual Mutual Guidance for Referring Expression Comprehension | Mar 18, 2020 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension | Mar 1, 2020 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| A Real-time Global Inference Network for One-stage Referring Expression Comprehension | Dec 7, 2019 | Diversityfeature selection | CodeCode Available | 0 |
| UNITER: Learning UNiversal Image-TExt Representations | Sep 25, 2019 | Image-text matchingImage-text Retrieval | —Unverified | 0 |
| Dynamic Graph Attention for Referring Expression Comprehension | Sep 18, 2019 | Graph AttentionReferring Expression | —Unverified | 0 |
| A Real-Time Cross-modality Correlation Filtering Method for Referring Expression Comprehension | Sep 16, 2019 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| Language-Conditioned Graph Networks for Relational Reasoning | May 10, 2019 | ObjectReferring Expression Comprehension | CodeCode Available | 0 |
| VQD: Visual Query Detection in Natural Scenes | Apr 4, 2019 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions | Jan 3, 2019 | DiagnosticImage Segmentation | CodeCode Available | 0 |
| Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks | Dec 12, 2018 | Graph AttentionObject | —Unverified | 0 |
| Real-Time Referring Expression Comprehension by Single-Stage Grounding Network | Dec 9, 2018 | AttributeReferring Expression | —Unverified | 0 |
| MAttNet: Modular Attention Network for Referring Expression Comprehension | Jan 24, 2018 | Generalized Referring Expression SegmentationReferring Expression | CodeCode Available | 0 |
| Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries | Nov 17, 2017 | ObjectObject Discovery | —Unverified | 0 |
| A Joint Speaker-Listener-Reinforcer Model for Referring Expressions | Dec 30, 2016 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 0 |
| Natural Language Object Retrieval | Nov 13, 2015 | Image CaptioningImage Retrieval | CodeCode Available | 0 |
| Deep Fragment Embeddings for Bidirectional Image Sentence Mapping | Jun 22, 2014 | Referring Expression ComprehensionRetrieval | —Unverified | 0 |