| IVLMap: Instance-Aware Visual Language Grounding for Consumer Robot Navigation | Mar 28, 2024 | AttributeLanguage Modelling | —Unverified | 0 |
| Scaling Vision-and-Language Navigation With Offline RL | Mar 27, 2024 | Offline RLVision and Language Navigation | —Unverified | 0 |
| OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation | Mar 26, 2024 | Vision and Language Navigation | —Unverified | 0 |
| Temporal-Spatial Object Relations Modeling for Vision-and-Language Navigation | Mar 23, 2024 | NavigateObject | —Unverified | 0 |
| Continual Vision-and-Language Navigation | Mar 22, 2024 | Continual LearningNavigate | —Unverified | 0 |
| Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation | Mar 18, 2024 | Common Sense ReasoningEfficient Exploration | CodeCode Available | 0 |
| Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation | Mar 15, 2024 | NavigateVision and Language Navigation | —Unverified | 0 |
| Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning | Mar 9, 2024 | Contrastive LearningNavigate | —Unverified | 0 |
| Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation | Mar 6, 2024 | Representation LearningVision and Language Navigation | —Unverified | 0 |
| NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation | Feb 24, 2024 | Decision MakingInstruction Following | —Unverified | 0 |
| VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation | Feb 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NavHint: Vision and Language Navigation Agent with a Hint Generator | Feb 4, 2024 | Vision and Language Navigation | CodeCode Available | 0 |
| MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation | Jan 14, 2024 | Decision MakingVision and Language Navigation | —Unverified | 0 |
| Which way is `right'?: Uncovering limitations of Vision-and-Language Navigation model | Nov 30, 2023 | Vision and Language Navigation | —Unverified | 0 |
| DAP: Domain-aware Prompt Learning for Vision-and-Language Navigation | Nov 29, 2023 | cross-modal alignmentNavigate | —Unverified | 0 |
| Does VLN Pretraining Work with Nonsensical or Irrelevant Instructions? | Nov 28, 2023 | Data AugmentationTranslation | —Unverified | 0 |
| Vision and Language Navigation in the Real World via Online Visual Language Mapping | Oct 16, 2023 | Vision and Language Navigation | —Unverified | 0 |
| LangNav: Language as a Perceptual Representation for Navigation | Oct 11, 2023 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Evaluating Explanation Methods for Vision-and-Language Navigation | Oct 10, 2023 | Decision MakingNavigate | —Unverified | 0 |
| Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation | Sep 7, 2023 | Contrastive Learningcross-modal alignment | —Unverified | 0 |
| VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation | Aug 20, 2023 | Transfer LearningVision and Language Navigation | CodeCode Available | 0 |
| A^2Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models | Aug 15, 2023 | NavigateRobot Navigation | —Unverified | 0 |
| Mind the Gap: Improving Success Rate of Vision-and-Language Navigation by Revisiting Oracle Success Routes | Aug 7, 2023 | NavigateVision and Language Navigation | —Unverified | 0 |
| Kefa: A Knowledge Enhanced and Fine-grained Aligned Speaker for Navigation Instruction Generation | Jul 25, 2023 | Vision and Language Navigation | CodeCode Available | 0 |
| Behavioral Analysis of Vision-and-Language Navigation Agents | Jul 20, 2023 | Vision and Language Navigation | CodeCode Available | 0 |