| Simple but Effective: CLIP Embeddings for Embodied AI | Nov 18, 2021 | Image ManipulationNavigate | CodeCode Available | 1 |
| Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation | Nov 10, 2021 | DecoderNavigate | CodeCode Available | 1 |
| History Aware Multimodal Transformer for Vision-and-Language Navigation | Oct 25, 2021 | Decision MakingNavigate | CodeCode Available | 1 |
| No RL, No Simulation: Learning to Navigate without Navigating | Oct 18, 2021 | NavigateReinforcement Learning (RL) | CodeCode Available | 1 |
| SGoLAM: Simultaneous Goal Localization and Mapping for Multi-Object Goal Navigation | Oct 14, 2021 | NavigateVisual Navigation | CodeCode Available | 1 |
| AMRA*: Anytime Multi-Resolution Multi-Heuristic A* | Oct 11, 2021 | Heuristic SearchMotion Planning | CodeCode Available | 1 |
| Enhancing Navigational Safety in Crowded Environments using Semantic-Deep-Reinforcement-Learning-based Navigation | Sep 23, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| DPMPC-Planner: A real-time UAV trajectory planning framework for complex static environments with dynamic obstacles | Sep 14, 2021 | Model Predictive ControlNavigate | CodeCode Available | 1 |
| Learning to Navigate Intersections with Unsupervised Driver Trait Inference | Sep 14, 2021 | Autonomous NavigationAutonomous Vehicles | CodeCode Available | 1 |
| TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios | Aug 26, 2021 | Data AugmentationNavigate | CodeCode Available | 1 |
| Airbert: In-domain Pretraining for Vision-and-Language Navigation | Aug 20, 2021 | NavigateReferring Expression | CodeCode Available | 1 |
| Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance | Aug 20, 2021 | DecoderGPU | CodeCode Available | 1 |
| Embodied BERT: A Transformer Model for Embodied, Language-guided Visual Task Completion | Aug 10, 2021 | NavigateObject | CodeCode Available | 1 |
| Towards real-world navigation with deep differentiable planners | Aug 8, 2021 | Imitation LearningMotion Planning | CodeCode Available | 1 |
| Versatile modular neural locomotion control with fast learning | Jul 16, 2021 | Navigate | CodeCode Available | 1 |
| Neighbor-view Enhanced Model for Vision and Language Navigation | Jul 15, 2021 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| Graphhopper: Multi-Hop Scene Graph Reasoning for Visual Question Answering | Jul 13, 2021 | NavigateQuestion Answering | CodeCode Available | 1 |
| Trans4Trans: Efficient Transformer for Transparent Object Segmentation to Help Visually Impaired People Navigate in the Real World | Jul 7, 2021 | DecoderNavigate | CodeCode Available | 1 |
| Collaborative Visual Navigation | Jul 2, 2021 | Multi-agent Reinforcement LearningNavigate | CodeCode Available | 1 |
| Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression | Jun 19, 2021 | Instruction FollowingNavigate | CodeCode Available | 1 |
| Vision-Language Navigation with Random Environmental Mixup | Jun 15, 2021 | Data AugmentationNavigate | CodeCode Available | 1 |
| A Semi-Personalized System for User Cold Start Recommendation on Music Streaming Apps | Jun 7, 2021 | ClusteringNavigate | CodeCode Available | 1 |
| Towards mental time travel: a hierarchical memory for reinforcement learning agents | May 28, 2021 | Meta-LearningNavigate | CodeCode Available | 1 |
| Goal Misgeneralization in Deep Reinforcement Learning | May 28, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Pushing it out of the Way: Interactive Visual Navigation | Apr 28, 2021 | NavigateVisual Navigation | CodeCode Available | 1 |