| Observation-Graph Interaction and Key-Detail Guidance for Vision and Language Navigation | Mar 14, 2025 | cross-modal alignmentNavigate | —Unverified | 0 |
| On the Evaluation of Vision-and-Language Navigation Instructions | Jan 26, 2021 | Vision and Language Navigation | —Unverified | 0 |
| Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs | Sep 27, 2024 | Decision MakingNavigate | —Unverified | 0 |
| OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation | Mar 26, 2024 | Vision and Language Navigation | —Unverified | 0 |
| PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation | Mar 13, 2025 | Image InpaintingImage Outpainting | —Unverified | 0 |
| PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation | May 30, 2023 | Image OutpaintingLanguage Modelling | —Unverified | 0 |
| PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For Vision-and-Language Navigation | May 19, 2023 | Data AugmentationVision and Language Navigation | —Unverified | 0 |
| Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation | Nov 30, 2024 | NavigateVision and Language Navigation | —Unverified | 0 |
| Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation | Sep 7, 2023 | Contrastive Learningcross-modal alignment | —Unverified | 0 |
| Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities | Jul 17, 2025 | Large Language ModelVision and Language Navigation | —Unverified | 0 |