| KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation | Mar 28, 2023 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| HOP+: History-enhanced and Order-aware Pre-training for Vision-and-Language Navigation | Mar 20, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding | Mar 7, 2023 | Vision and Language NavigationVisual Navigation | —Unverified | 0 |
| MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation | Mar 2, 2023 | NavigateVision and Language Navigation | CodeCode Available | 0 |
| ESceme: Vision-and-Language Navigation with Episodic Scene Memory | Mar 2, 2023 | Vision and Language Navigation | CodeCode Available | 1 |
| VLN-Trans: Translator for the Vision and Language Navigation Agent | Feb 18, 2023 | Vision and Language Navigation | CodeCode Available | 1 |
| Graph based Environment Representation for Vision-and-Language Navigation in Continuous Environments | Jan 11, 2023 | Objectobject-detection | —Unverified | 0 |
| BEVBert: Multimodal Map Pre-training for Language-guided Navigation | Dec 8, 2022 | Vision and Language NavigationVisual Navigation | CodeCode Available | 2 |
| CLIP-Nav: Using CLIP for Zero-Shot Vision-and-Language Navigation | Nov 30, 2022 | DiversityInstruction Following | —Unverified | 0 |
| Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning | Nov 27, 2022 | Federated LearningNavigate | —Unverified | 0 |