| Speaker-Follower Models for Vision-and-Language Navigation | Jun 7, 2018 | Data AugmentationVision and Language Navigation | CodeCode Available | 0 | 5 |
| Ground then Navigate: Language-guided Navigation in Dynamic Scenes | Sep 24, 2022 | Autonomous DrivingNavigate | CodeCode Available | 0 | 5 |
| REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments | Apr 23, 2019 | Referring ExpressionVision and Language Navigation | CodeCode Available | 0 | 5 |
| NavHint: Vision and Language Navigation Agent with a Hint Generator | Feb 4, 2024 | Vision and Language Navigation | CodeCode Available | 0 | 5 |
| GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation | May 26, 2023 | Vision and Language Navigation | CodeCode Available | 0 | 5 |
| A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location Cues | Jul 24, 2022 | cross-modal alignmentTrajectory Planning | CodeCode Available | 0 | 5 |
| Narrowing the Gap between Vision and Action in Navigation | Aug 19, 2024 | DecoderSpatial Reasoning | CodeCode Available | 0 | 5 |
| FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation | Jun 9, 2022 | Vision and Language Navigation | CodeCode Available | 0 | 5 |
| A Navigation Framework Utilizing Vision-Language Models | Jun 11, 2025 | NavigatePrompt Engineering | CodeCode Available | 0 | 5 |
| MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation | Mar 2, 2023 | NavigateVision and Language Navigation | CodeCode Available | 0 | 5 |