| Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs | Sep 27, 2024 | Decision MakingNavigate | —Unverified | 0 |
| Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation | Sep 9, 2024 | Vision and Language Navigation | CodeCode Available | 0 |
| Seeing is Believing? Enhancing Vision-Language Navigation using Visual Perturbations | Sep 9, 2024 | Autonomous NavigationDiversity | —Unverified | 0 |
| FLAME: Learning to Navigate with Multimodal LLM in Urban Environments | Aug 20, 2024 | NavigateVision and Language Navigation | CodeCode Available | 2 |
| Narrowing the Gap between Vision and Action in Navigation | Aug 19, 2024 | DecoderSpatial Reasoning | CodeCode Available | 0 |
| Loc4Plan: Locating Before Planning for Outdoor Vision and Language Navigation | Aug 9, 2024 | NavigatePosition | —Unverified | 0 |
| Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments | Jul 31, 2024 | graph constructionNavigate | CodeCode Available | 1 |
| NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models | Jul 17, 2024 | Instruction FollowingVision and Language Navigation | CodeCode Available | 3 |
| PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation | Jul 16, 2024 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models | Jul 9, 2024 | Vision and Language Navigation | CodeCode Available | 3 |