| Counterfactual Vision-and-Language Navigation: Unravelling the Unseen | Dec 1, 2020 | counterfactualEmbodied Question Answering | —Unverified | 0 | 0 |
| CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation | Mar 1, 2021 | TranslationVision and Language Navigation | —Unverified | 0 | 0 |
| Curriculum Learning for Vision-and-Language Navigation | Nov 14, 2021 | Vision and Language Navigation | —Unverified | 0 | 0 |
| DAP: Domain-aware Prompt Learning for Vision-and-Language Navigation | Nov 29, 2023 | cross-modal alignmentNavigate | —Unverified | 0 | 0 |
| Diagnosing Vision-and-Language Navigation: What Really Matters | Dec 17, 2021 | DiagnosticObject | —Unverified | 0 | 0 |
| Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Does VLN Pretraining Work with Nonsensical or Irrelevant Instructions? | Nov 28, 2023 | Data AugmentationTranslation | —Unverified | 0 | 0 |
| DOPE: Dual Object Perception-Enhancement Network for Vision-and-Language Navigation | Apr 30, 2025 | NavigateObject | —Unverified | 0 | 0 |
| Do Visual Imaginations Improve Vision-and-Language Navigation Agents? | Mar 20, 2025 | Vision and Language Navigation | —Unverified | 0 | 0 |
| Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation | Apr 9, 2025 | HallucinationSpatial Reasoning | —Unverified | 0 | 0 |