| Narrowing the Gap between Vision and Action in Navigation | Aug 19, 2024 | DecoderSpatial Reasoning | CodeCode Available | 0 |
| GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation | May 26, 2023 | Vision and Language Navigation | CodeCode Available | 0 |
| Augmented Commonsense Knowledge for Remote Object Grounding | Jun 3, 2024 | Decision MakingObject | CodeCode Available | 0 |
| MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation | Mar 2, 2023 | NavigateVision and Language Navigation | CodeCode Available | 0 |
| A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location Cues | Jul 24, 2022 | cross-modal alignmentTrajectory Planning | CodeCode Available | 0 |
| ULN: Towards Underspecified Vision-and-Language Navigation | Oct 18, 2022 | Vision and Language Navigation | CodeCode Available | 0 |
| LOViS: Learning Orientation and Visual Signals for Vision and Language Navigation | Sep 26, 2022 | Spatial ReasoningVision and Language Navigation | CodeCode Available | 0 |
| Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation | Mar 21, 2018 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Local Slot Attention for Vision-and-Language Navigation | Jun 17, 2022 | NavigateVision and Language Navigation | CodeCode Available | 0 |
| VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation | Aug 20, 2023 | Transfer LearningVision and Language Navigation | CodeCode Available | 0 |
| FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation | Jun 9, 2022 | Vision and Language Navigation | CodeCode Available | 0 |
| Explicit Object Relation Alignment for Vision and Language Navigation | May 1, 2022 | ObjectRelation | CodeCode Available | 0 |
| Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation | Sep 9, 2024 | Vision and Language Navigation | CodeCode Available | 0 |
| Speaker-Follower Models for Vision-and-Language Navigation | Jun 7, 2018 | Data AugmentationVision and Language Navigation | CodeCode Available | 0 |
| Chasing Ghosts: Instruction Following as Bayesian State Tracking | Jul 3, 2019 | Instruction FollowingVision and Language Navigation | CodeCode Available | 0 |
| Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters | Jul 5, 2019 | Vision and Language Navigation | CodeCode Available | 0 |
| A Navigation Framework Utilizing Vision-Language Models | Jun 11, 2025 | NavigatePrompt Engineering | CodeCode Available | 0 |
| Kefa: A Knowledge Enhanced and Fine-grained Aligned Speaker for Navigation Instruction Generation | Jul 25, 2023 | Vision and Language Navigation | CodeCode Available | 0 |
| Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation | Mar 6, 2019 | Vision and Language NavigationVision-Language Navigation | CodeCode Available | 0 |
| Diagnosing Vision-and-Language Navigation: What Really Matters | Mar 30, 2021 | DiagnosticObject | CodeCode Available | 0 |
| Behavioral Analysis of Vision-and-Language Navigation Agents | Jul 20, 2023 | Vision and Language Navigation | CodeCode Available | 0 |
| DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning | Apr 2, 2024 | Contrastive LearningDecision Making | CodeCode Available | 0 |
| Into the Unknown: Generating Geospatial Descriptions for New Environments | Jun 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |