| NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models | May 26, 2023 | Instruction FollowingVision and Language Navigation | CodeCode Available | 2 |
| NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments | Jun 30, 2025 | Decision MakingVision and Language Navigation | CodeCode Available | 2 |
| Scaling Data Generation in Vision-and-Language Navigation | Jul 28, 2023 | Imitation LearningVision and Language Navigation | CodeCode Available | 2 |
| Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation | Jun 14, 2024 | NavigateVision and Language Navigation | CodeCode Available | 2 |
| BEVBert: Multimodal Map Pre-training for Language-guided Navigation | Dec 8, 2022 | Vision and Language NavigationVisual Navigation | CodeCode Available | 2 |
| Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation | May 16, 2025 | 3D geometryNavigate | CodeCode Available | 2 |
| FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models | May 19, 2025 | Disaster ResponseVision and Language Navigation | CodeCode Available | 2 |
| General Scene Adaptation for Vision-and-Language Navigation | Jan 29, 2025 | DiversityVision and Language Navigation | CodeCode Available | 2 |
| 1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022) | Jun 23, 2022 | Data AugmentationVision and Language Navigation | CodeCode Available | 2 |
| FLAME: Learning to Navigate with Multimodal LLM in Urban Environments | Aug 20, 2024 | NavigateVision and Language Navigation | CodeCode Available | 2 |