| NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation | Dec 17, 2024 | Few-Shot LearningVision and Language Navigation | —Unverified | 0 |
| RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation | Dec 11, 2024 | 3D ReconstructionDiversity | —Unverified | 0 |
| World-Consistent Data Generation for Vision-and-Language Navigation | Dec 9, 2024 | Data AugmentationNavigate | —Unverified | 0 |
| NaVILA: Legged Robot Vision-Language-Action Model for Navigation | Dec 5, 2024 | NavigateVision and Language Navigation | —Unverified | 0 |
| Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks | Dec 3, 2024 | Adversarial AttackVision and Language Navigation | —Unverified | 0 |
| Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation | Nov 30, 2024 | NavigateVision and Language Navigation | —Unverified | 0 |
| UnitedVLN: Generalizable Gaussian Splatting for Continuous Vision-Language Navigation | Nov 25, 2024 | 3DGSNavigate | —Unverified | 0 |
| Fine-Grained Alignment in Vision-and-Language Navigation through Bayesian Optimization | Nov 22, 2024 | Bayesian OptimizationContrastive Learning | —Unverified | 0 |
| NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation | Nov 13, 2024 | NavigateVision and Language Navigation | —Unverified | 0 |
| Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Zero-Shot Vision-and-Language Navigation with Collision Mitigation in Continuous Environment | Oct 7, 2024 | Large Language ModelVision and Language Navigation | —Unverified | 0 |
| Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs | Sep 27, 2024 | Decision MakingNavigate | —Unverified | 0 |
| MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation | Sep 27, 2024 | Knowledge DistillationVision and Language Navigation | —Unverified | 0 |
| Seeing is Believing? Enhancing Vision-Language Navigation using Visual Perturbations | Sep 9, 2024 | Autonomous NavigationDiversity | —Unverified | 0 |
| Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation | Sep 9, 2024 | Vision and Language Navigation | CodeCode Available | 0 |
| Narrowing the Gap between Vision and Action in Navigation | Aug 19, 2024 | DecoderSpatial Reasoning | CodeCode Available | 0 |
| Loc4Plan: Locating Before Planning for Outdoor Vision and Language Navigation | Aug 9, 2024 | NavigatePosition | —Unverified | 0 |
| Into the Unknown: Generating Geospatial Descriptions for New Environments | Jun 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| Contrast Sets for Evaluating Language-Guided Robot Policies | Jun 19, 2024 | Vision and Language Navigation | —Unverified | 0 |
| I2EDL: Interactive Instruction Error Detection and Localization | Jun 7, 2024 | Vision and Language Navigation | —Unverified | 0 |
| Augmented Commonsense Knowledge for Remote Object Grounding | Jun 3, 2024 | Decision MakingObject | CodeCode Available | 0 |
| Vision-and-Language Navigation Generative Pretrained Transformer | May 27, 2024 | DecoderImitation Learning | —Unverified | 0 |
| MC-GPT: Empowering Vision-and-Language Navigation with Memory Map and Reasoning Chains | May 17, 2024 | DiversityNavigate | —Unverified | 0 |
| AIGeN: An Adversarial Approach for Instruction Generation in VLN | Apr 15, 2024 | DecoderVision and Language Navigation | —Unverified | 0 |
| DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning | Apr 2, 2024 | Contrastive LearningDecision Making | CodeCode Available | 0 |