| The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation | Apr 9, 2021 | Vision and Language NavigationVision-Language Navigation | CodeCode Available | 1 |
| Diagnosing Vision-and-Language Navigation: What Really Matters | Mar 30, 2021 | DiagnosticObject | CodeCode Available | 0 |
| CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation | Mar 1, 2021 | TranslationVision and Language Navigation | —Unverified | 0 |
| On the Evaluation of Vision-and-Language Navigation Instructions | Jan 26, 2021 | Vision and Language Navigation | —Unverified | 0 |
| Visual Perception Generalization for Vision-and-Language Navigation via Meta-Learning | Dec 10, 2020 | Meta-LearningNavigate | —Unverified | 0 |
| Topological Planning with Transformers for Vision-and-Language Navigation | Dec 9, 2020 | Vision and Language Navigation | —Unverified | 0 |
| Counterfactual Vision-and-Language Navigation: Unravelling the Unseen | Dec 1, 2020 | counterfactualEmbodied Question Answering | —Unverified | 0 |
| A Recurrent Vision-and-Language BERT for Navigation | Nov 26, 2020 | Decision MakingDecoder | CodeCode Available | 1 |
| Language-guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning | Nov 22, 2020 | Imitation LearningNavigate | —Unverified | 0 |
| ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments | Nov 15, 2020 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |