| Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions | Mar 22, 2022 | Vision and Language Navigation | CodeCode Available | 2 |
| Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation | Feb 23, 2022 | Efficient ExplorationNavigate | CodeCode Available | 2 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | May 8, 2025 | Large Language ModelNavigate | CodeCode Available | 1 |
| Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language Navigation | Dec 9, 2024 | Object LocalizationVision and Language Navigation | CodeCode Available | 1 |
| g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks | Nov 26, 2024 | Contrastive LearningQuestion Answering | CodeCode Available | 1 |
| Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments | Jul 31, 2024 | graph constructionNavigate | CodeCode Available | 1 |
| PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation | Jul 16, 2024 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation | Jun 25, 2024 | Knowledge DistillationTest unseen | CodeCode Available | 1 |
| Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts | Jun 4, 2024 | NavigateVision and Language Navigation | CodeCode Available | 1 |