| Diagnosing Vision-and-Language Navigation: What Really Matters | Dec 17, 2021 | DiagnosticObject | —Unverified | 0 |
| Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision | Dec 1, 2021 | cross-modal alignmentNavigate | CodeCode Available | 1 |
| Explore the Potential Performance of Vision-and-Language Navigation Model: a Snapshot Ensemble Method | Nov 28, 2021 | Vision and Language Navigation | —Unverified | 0 |
| Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions | Nov 16, 2021 | Vision and Language Navigation | —Unverified | 0 |
| Explicit Object Relation Alignment for Vision and Language Navigation | Nov 16, 2021 | Instruction FollowingRelation | —Unverified | 0 |
| Curriculum Learning for Vision-and-Language Navigation | Nov 14, 2021 | Vision and Language Navigation | —Unverified | 0 |
| Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation | Nov 10, 2021 | DecoderNavigate | CodeCode Available | 1 |
| SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation | Oct 27, 2021 | ObjectScene Classification | —Unverified | 0 |
| History Aware Multimodal Transformer for Vision-and-Language Navigation | Oct 25, 2021 | Decision MakingNavigate | CodeCode Available | 1 |
| Rethinking the Spatial Route Prior in Vision-and-Language Navigation | Oct 12, 2021 | NavigateVision and Language Navigation | —Unverified | 0 |