| Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation | Mar 18, 2024 | Common Sense ReasoningEfficient Exploration | CodeCode Available | 0 |
| Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation | Mar 15, 2024 | NavigateVision and Language Navigation | —Unverified | 0 |
| NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning | Mar 12, 2024 | NavigateVision and Language Navigation | CodeCode Available | 2 |
| Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning | Mar 9, 2024 | Contrastive LearningNavigate | —Unverified | 0 |
| Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation | Mar 6, 2024 | Representation LearningVision and Language Navigation | —Unverified | 0 |
| NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation | Feb 24, 2024 | Decision MakingInstruction Following | —Unverified | 0 |
| WebLINX: Real-World Website Navigation with Multi-Turn Dialogue | Feb 8, 2024 | Conversational Web NavigationText Generation | CodeCode Available | 5 |
| VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation | Feb 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NavHint: Vision and Language Navigation Agent with a Hint Generator | Feb 4, 2024 | Vision and Language Navigation | CodeCode Available | 0 |
| MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation | Jan 14, 2024 | Decision MakingVision and Language Navigation | —Unverified | 0 |