| ALFWorld: Aligning Text and Embodied Environments for Interactive Learning | Oct 8, 2020 | Natural Language Visual GroundingScene Understanding | CodeCode Available | 1 |
| A Linguistic Analysis of Visually Grounded Dialogues Based on Spatial Expressions | Oct 7, 2020 | Coreference ResolutionNatural Language Visual Grounding | CodeCode Available | 1 |
| Learning Cross-modal Context Graph for Visual Grounding | Feb 13, 2020 | Graph MatchingGraph Neural Network | CodeCode Available | 1 |
| ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks | Dec 3, 2019 | Natural Language Visual Grounding | CodeCode Available | 1 |
| Self-Monitoring Navigation Agent via Auxiliary Progress Estimation | Jan 10, 2019 | Natural Language Visual GroundingVision and Language Navigation | CodeCode Available | 1 |
| Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences | Jan 20, 2023 | Coherence EvaluationGrounded language learning | —Unverified | 0 |
| Composing Pick-and-Place Tasks By Grounding Language | Feb 16, 2021 | Natural Language Visual GroundingRobotic Grasping | CodeCode Available | 0 |
| Searching for Ambiguous Objects in Videos using Relational Referring Expressions | Aug 3, 2019 | Deep AttentionNatural Language Visual Grounding | CodeCode Available | 0 |
| Modularized Textual Grounding for Counterfactual Resilience | Apr 7, 2019 | Attributecounterfactual | CodeCode Available | 0 |
| Robust Change Captioning | Jan 8, 2019 | Natural Language Visual Grounding | CodeCode Available | 0 |