| Diagnosing Vision-and-Language Navigation: What Really Matters | Dec 17, 2021 | DiagnosticObject | —Unverified | 0 |
| Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision | Dec 1, 2021 | cross-modal alignmentNavigate | CodeCode Available | 1 |
| Explore the Potential Performance of Vision-and-Language Navigation Model: a Snapshot Ensemble Method | Nov 28, 2021 | Vision and Language Navigation | —Unverified | 0 |
| Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions | Nov 16, 2021 | Vision and Language Navigation | —Unverified | 0 |
| Explicit Object Relation Alignment for Vision and Language Navigation | Nov 16, 2021 | Instruction FollowingRelation | —Unverified | 0 |
| Curriculum Learning for Vision-and-Language Navigation | Nov 14, 2021 | Vision and Language Navigation | —Unverified | 0 |
| Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation | Nov 10, 2021 | DecoderNavigate | CodeCode Available | 1 |
| SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation | Oct 27, 2021 | ObjectScene Classification | —Unverified | 0 |
| History Aware Multimodal Transformer for Vision-and-Language Navigation | Oct 25, 2021 | Decision MakingNavigate | CodeCode Available | 1 |
| Rethinking the Spatial Route Prior in Vision-and-Language Navigation | Oct 12, 2021 | NavigateVision and Language Navigation | —Unverified | 0 |
| Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments | Sep 30, 2021 | Vision and Language Navigation | —Unverified | 0 |
| SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments | Aug 26, 2021 | Vision and Language Navigation | CodeCode Available | 1 |
| Airbert: In-domain Pretraining for Vision-and-Language Navigation | Aug 20, 2021 | NavigateReferring Expression | CodeCode Available | 1 |
| Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation | Jul 23, 2021 | Vision and Language NavigationVision-Language Navigation | CodeCode Available | 1 |
| Neighbor-view Enhanced Model for Vision and Language Navigation | Jul 15, 2021 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| How Much Can CLIP Benefit Vision-and-Language Tasks? | Jul 13, 2021 | Question AnsweringVision and Language Navigation | CodeCode Available | 1 |
| VLN BERT: A Recurrent Vision-and-Language BERT for Navigation | Jun 19, 2021 | Decision MakingDecoder | —Unverified | 0 |
| VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator | May 25, 2021 | Binary ClassificationImitation Learning | CodeCode Available | 0 |
| Pathdreamer: A World Model for Indoor Navigation | May 18, 2021 | modelSemantic Segmentation | CodeCode Available | 1 |
| Episodic Transformer for Vision-and-Language Navigation | May 13, 2021 | Vision and Language Navigation | CodeCode Available | 1 |
| The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation | Apr 9, 2021 | Vision and Language NavigationVision-Language Navigation | CodeCode Available | 1 |
| Diagnosing Vision-and-Language Navigation: What Really Matters | Mar 30, 2021 | DiagnosticObject | CodeCode Available | 0 |
| CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation | Mar 1, 2021 | TranslationVision and Language Navigation | —Unverified | 0 |
| On the Evaluation of Vision-and-Language Navigation Instructions | Jan 26, 2021 | Vision and Language Navigation | —Unverified | 0 |
| Visual Perception Generalization for Vision-and-Language Navigation via Meta-Learning | Dec 10, 2020 | Meta-LearningNavigate | —Unverified | 0 |
| Topological Planning with Transformers for Vision-and-Language Navigation | Dec 9, 2020 | Vision and Language Navigation | —Unverified | 0 |
| Counterfactual Vision-and-Language Navigation: Unravelling the Unseen | Dec 1, 2020 | counterfactualEmbodied Question Answering | —Unverified | 0 |
| A Recurrent Vision-and-Language BERT for Navigation | Nov 26, 2020 | Decision MakingDecoder | CodeCode Available | 1 |
| Language-guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning | Nov 22, 2020 | Imitation LearningNavigate | —Unverified | 0 |
| ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments | Nov 15, 2020 | Referring ExpressionReferring Expression Comprehension | —Unverified | 0 |
| Sim-to-Real Transfer for Vision-and-Language Navigation | Nov 7, 2020 | Vision and Language Navigation | CodeCode Available | 1 |
| Retouchdown: Releasing Touchdown on StreetLearn as a Public Resource for Language Grounding Tasks in Street View | Nov 1, 2020 | Vision and Language Navigation | —Unverified | 0 |
| Language and Visual Entity Relationship Graph for Agent Navigation | Oct 19, 2020 | Dynamic Time WarpingNavigate | CodeCode Available | 1 |
| Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding | Oct 15, 2020 | Vision and Language Navigation | CodeCode Available | 1 |
| Learning to Stop: A Simple yet Effective Approach to Urban Vision-Language Navigation | Sep 28, 2020 | NavigateVision and Language Navigation | —Unverified | 0 |
| Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule | Sep 16, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Counterfactual Vision-and-Language Navigation via Adversarial Path Sampler | Aug 1, 2020 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Object-and-Action Aware Model for Visual Language Navigation | Jul 29, 2020 | ObjectVision and Language Navigation | —Unverified | 0 |
| Soft Expert Reward Learning for Vision-and-Language Navigation | Jul 21, 2020 | Reinforcement Learning (RL)Vision and Language Navigation | —Unverified | 0 |
| Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation | Jul 11, 2020 | Decision MakingImitation Learning | —Unverified | 0 |
| Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation | Jul 1, 2020 | Style TransferText Style Transfer | CodeCode Available | 1 |
| Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments – Extended Abstract | Jun 12, 2020 | Vision and Language Navigation | —Unverified | 0 |
| Extended Abstract: Improving Vision-and-Language Navigation with Image-Text Pairs from the Web | Jun 12, 2020 | Vision and Language Navigation | —Unverified | 0 |
| BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps | May 10, 2020 | Imitation LearningNavigate | CodeCode Available | 1 |
| Diagnosing the Environment Bias in Vision-and-Language Navigation | May 6, 2020 | Vision and Language Navigation | CodeCode Available | 1 |
| Improving Vision-and-Language Navigation with Image-Text Pairs from the Web | Apr 30, 2020 | Vision and Language Navigation | CodeCode Available | 1 |
| Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments | Apr 6, 2020 | Vision and Language Navigation | CodeCode Available | 1 |
| Sub-Instruction Aware Vision-and-Language Navigation | Apr 6, 2020 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| Take the Scenic Route: Improving Generalization in Vision-and-Language Navigation | Mar 31, 2020 | Vision and Language Navigation | —Unverified | 0 |
| Multi-View Learning for Vision-and-Language Navigation | Mar 2, 2020 | MULTI-VIEW LEARNINGNavigate | —Unverified | 0 |