| Behavioral Analysis of Vision-and-Language Navigation Agents | Jul 20, 2023 | Vision and Language Navigation | CodeCode Available | 0 |
| VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View | Jul 12, 2023 | Decision MakingNatural Language Understanding | CodeCode Available | 1 |
| CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation | Jun 17, 2023 | Decision MakingInstruction Following | —Unverified | 0 |
| PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation | May 30, 2023 | Image OutpaintingLanguage Modelling | —Unverified | 0 |
| GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation | May 26, 2023 | Vision and Language Navigation | CodeCode Available | 0 |
| NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models | May 26, 2023 | Instruction FollowingVision and Language Navigation | CodeCode Available | 2 |
| Masked Path Modeling for Vision-and-Language Navigation | May 23, 2023 | Action GenerationNavigate | —Unverified | 0 |
| PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For Vision-and-Language Navigation | May 19, 2023 | Data AugmentationVision and Language Navigation | —Unverified | 0 |
| A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation | May 5, 2023 | Vision and Language Navigation | CodeCode Available | 1 |
| Improving Vision-and-Language Navigation by Generating Future-View Image Semantics | Apr 11, 2023 | Image GenerationNavigate | —Unverified | 0 |