| Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation | Mar 21, 2018 | Deep Reinforcement Learningmodel | CodeCode Available | 0 | 5 |
| FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation | Jun 9, 2022 | Vision and Language Navigation | CodeCode Available | 0 | 5 |
| Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation | Sep 9, 2024 | Vision and Language Navigation | CodeCode Available | 0 | 5 |
| Speaker-Follower Models for Vision-and-Language Navigation | Jun 7, 2018 | Data AugmentationVision and Language Navigation | CodeCode Available | 0 | 5 |
| Diagnosing Vision-and-Language Navigation: What Really Matters | Mar 30, 2021 | DiagnosticObject | CodeCode Available | 0 | 5 |
| CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations | Jul 5, 2022 | NavigateRepresentation Learning | CodeCode Available | 0 | 5 |
| Augmented Commonsense Knowledge for Remote Object Grounding | Jun 3, 2024 | Decision MakingObject | CodeCode Available | 0 | 5 |
| Chasing Ghosts: Instruction Following as Bayesian State Tracking | Jul 3, 2019 | Instruction FollowingVision and Language Navigation | CodeCode Available | 0 | 5 |
| Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation | Mar 6, 2019 | Vision and Language NavigationVision-Language Navigation | CodeCode Available | 0 | 5 |
| Into the Unknown: Generating Geospatial Descriptions for New Environments | Jun 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 | 5 |
| LOViS: Learning Orientation and Visual Signals for Vision and Language Navigation | Sep 26, 2022 | Spatial ReasoningVision and Language Navigation | CodeCode Available | 0 | 5 |
| DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning | Apr 2, 2024 | Contrastive LearningDecision Making | CodeCode Available | 0 | 5 |
| Narrowing the Gap between Vision and Action in Navigation | Aug 19, 2024 | DecoderSpatial Reasoning | CodeCode Available | 0 | 5 |
| The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation | Mar 5, 2019 | Decision MakingVision and Language Navigation | CodeCode Available | 0 | 5 |
| Zero-Shot Vision-and-Language Navigation with Collision Mitigation in Continuous Environment | Oct 7, 2024 | Large Language ModelVision and Language Navigation | —Unverified | 0 | 0 |
| A^2Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models | Aug 15, 2023 | NavigateRobot Navigation | —Unverified | 0 | 0 |
| Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Aerial Vision-and-Language Navigation with Grid-based View Selection and Map Construction | Mar 14, 2025 | NavigateVision and Language Navigation | —Unverified | 0 | 0 |
| AIGeN: An Adversarial Approach for Instruction Generation in VLN | Apr 15, 2024 | DecoderVision and Language Navigation | —Unverified | 0 | 0 |
| A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning | Oct 6, 2022 | Imitation LearningInstruction Following | —Unverified | 0 | 0 |
| Anticipating the Unseen Discrepancy for Vision and Language Navigation | Sep 10, 2022 | Data AugmentationDecision Making | —Unverified | 0 | 0 |
| ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments | Nov 15, 2020 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 | 0 |
| Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments – Extended Abstract | Jun 12, 2020 | Vision and Language Navigation | —Unverified | 0 | 0 |
| Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation | Mar 6, 2024 | Representation LearningVision and Language Navigation | —Unverified | 0 | 0 |
| CLIP-Nav: Using CLIP for Zero-Shot Vision-and-Language Navigation | Nov 30, 2022 | DiversityInstruction Following | —Unverified | 0 | 0 |
| Continual Vision-and-Language Navigation | Mar 22, 2024 | Continual LearningNavigate | —Unverified | 0 | 0 |
| Contrast Sets for Evaluating Language-Guided Robot Policies | Jun 19, 2024 | Vision and Language Navigation | —Unverified | 0 | 0 |
| COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation | Mar 31, 2025 | MemorizationVision and Language Navigation | —Unverified | 0 | 0 |
| Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling | Nov 17, 2019 | counterfactualCounterfactual Reasoning | —Unverified | 0 | 0 |
| Counterfactual Vision-and-Language Navigation via Adversarial Path Sampler | Aug 1, 2020 | counterfactualCounterfactual Reasoning | —Unverified | 0 | 0 |
| Counterfactual Vision-and-Language Navigation: Unravelling the Unseen | Dec 1, 2020 | counterfactualEmbodied Question Answering | —Unverified | 0 | 0 |
| CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation | Mar 1, 2021 | TranslationVision and Language Navigation | —Unverified | 0 | 0 |
| Curriculum Learning for Vision-and-Language Navigation | Nov 14, 2021 | Vision and Language Navigation | —Unverified | 0 | 0 |
| DAP: Domain-aware Prompt Learning for Vision-and-Language Navigation | Nov 29, 2023 | cross-modal alignmentNavigate | —Unverified | 0 | 0 |
| Diagnosing Vision-and-Language Navigation: What Really Matters | Dec 17, 2021 | DiagnosticObject | —Unverified | 0 | 0 |
| Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Does VLN Pretraining Work with Nonsensical or Irrelevant Instructions? | Nov 28, 2023 | Data AugmentationTranslation | —Unverified | 0 | 0 |
| DOPE: Dual Object Perception-Enhancement Network for Vision-and-Language Navigation | Apr 30, 2025 | NavigateObject | —Unverified | 0 | 0 |
| Do Visual Imaginations Improve Vision-and-Language Navigation Agents? | Mar 20, 2025 | Vision and Language Navigation | —Unverified | 0 | 0 |
| Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation | Apr 9, 2025 | HallucinationSpatial Reasoning | —Unverified | 0 | 0 |
| Evaluating Explanation Methods for Vision-and-Language Navigation | Oct 10, 2023 | Decision MakingNavigate | —Unverified | 0 | 0 |
| Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation | Jul 11, 2020 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Explicit Object Relation Alignment for Vision and Language Navigation | Nov 16, 2021 | Instruction FollowingRelation | —Unverified | 0 | 0 |
| Explore the Potential Performance of Vision-and-Language Navigation Model: a Snapshot Ensemble Method | Nov 28, 2021 | Vision and Language Navigation | —Unverified | 0 | 0 |
| Explore the Potential Performance of Vision-and-Language Navigation Model: a Snapshot Ensemble Method | Jan 16, 2022 | Vision and Language Navigation | —Unverified | 0 | 0 |
| Extended Abstract: Improving Vision-and-Language Navigation with Image-Text Pairs from the Web | Jun 12, 2020 | Vision and Language Navigation | —Unverified | 0 | 0 |
| Fine-Grained Alignment in Vision-and-Language Navigation through Bayesian Optimization | Nov 22, 2024 | Bayesian OptimizationContrastive Learning | —Unverified | 0 | 0 |
| FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks | Mar 18, 2025 | Vision and Language Navigation | —Unverified | 0 | 0 |
| Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule | Sep 16, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Graph based Environment Representation for Vision-and-Language Navigation in Continuous Environments | Jan 11, 2023 | Objectobject-detection | —Unverified | 0 | 0 |