| HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation | Mar 22, 2022 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Cross-modal Map Learning for Vision and Language Navigation | Mar 10, 2022 | Vision and Language Navigation | CodeCode Available | 1 |
| Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation | Mar 5, 2022 | Imitation LearningVision and Language Navigation | CodeCode Available | 1 |
| One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones | Feb 14, 2022 | Vision and Language Navigation | CodeCode Available | 1 |
| Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision | Dec 1, 2021 | cross-modal alignmentNavigate | CodeCode Available | 1 |
| Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation | Nov 10, 2021 | DecoderNavigate | CodeCode Available | 1 |
| History Aware Multimodal Transformer for Vision-and-Language Navigation | Oct 25, 2021 | Decision MakingNavigate | CodeCode Available | 1 |
| SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments | Aug 26, 2021 | Vision and Language Navigation | CodeCode Available | 1 |
| Airbert: In-domain Pretraining for Vision-and-Language Navigation | Aug 20, 2021 | NavigateReferring Expression | CodeCode Available | 1 |
| Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation | Jul 23, 2021 | Vision and Language NavigationVision-Language Navigation | CodeCode Available | 1 |