| Aerial Vision-and-Dialog Navigation | May 24, 2022 | Navigate | CodeCode Available | 1 |
| LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation | May 20, 2022 | Navigate | CodeCode Available | 1 |
| VesNet-RL: Simulation-based Reinforcement Learning for Real-World US Probe Navigation | May 10, 2022 | DiagnosticNavigate | CodeCode Available | 1 |
| Reinforced Structured State-Evolution for Vision-Language Navigation | Apr 20, 2022 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| Learning Forward Dynamics Model and Informed Trajectory Sampler for Safe Quadruped Navigation | Apr 19, 2022 | Autonomous NavigationNavigate | CodeCode Available | 1 |
| MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration | Apr 17, 2022 | NavigateRetrieval | CodeCode Available | 1 |
| Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation | Mar 30, 2022 | counterfactualData Augmentation | CodeCode Available | 1 |
| EnvEdit: Environment Editing for Vision-and-Language Navigation | Mar 29, 2022 | Data AugmentationDiversity | CodeCode Available | 1 |
| Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning | Mar 28, 2022 | Distributional Reinforcement LearningDrone navigation | CodeCode Available | 1 |
| Possibility Before Utility: Learning And Using Hierarchical Affordances | Mar 23, 2022 | Hierarchical Reinforcement LearningNavigate | CodeCode Available | 1 |
| NavDreams: Towards Camera-Only RL Navigation Among Humans | Mar 23, 2022 | Atari GamesNavigate | CodeCode Available | 1 |
| WayFAST: Navigation with Predictive Traversability in the Field | Mar 22, 2022 | Navigate | CodeCode Available | 1 |
| Pedestrian Stop and Go Forecasting with Hybrid Feature Fusion | Mar 4, 2022 | Autonomous Drivingmotion prediction | CodeCode Available | 1 |
| DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following | Feb 27, 2022 | Instruction FollowingNavigate | CodeCode Available | 1 |
| Sound Adversarial Audio-Visual Navigation | Feb 22, 2022 | NavigateVisual Navigation | CodeCode Available | 1 |
| GPT-based Open-Ended Knowledge Tracing | Feb 21, 2022 | Code GenerationKnowledge Tracing | CodeCode Available | 1 |
| Navigating Local Minima in Quantized Spiking Neural Networks | Feb 15, 2022 | Navigate | CodeCode Available | 1 |
| PONI: Potential Functions for ObjectGoal Navigation with Interaction-free Learning | Jan 25, 2022 | NavigateObjectGoal Navigation | CodeCode Available | 1 |
| SQUIRE: A Sequence-to-sequence Framework for Multi-hop Knowledge Graph Reasoning | Jan 17, 2022 | DecoderNavigate | CodeCode Available | 1 |
| WebGPT: Browser-assisted question-answering with human feedback | Dec 17, 2021 | Imitation LearningNavigate | CodeCode Available | 1 |
| Do Pedestrians Pay Attention? Eye Contact Detection in the Wild | Dec 8, 2021 | Autonomous VehiclesContact Detection | CodeCode Available | 1 |
| FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization | Dec 2, 2021 | counterfactualImage Generation | CodeCode Available | 1 |
| Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision | Dec 1, 2021 | cross-modal alignmentNavigate | CodeCode Available | 1 |
| Learning to automate cryo-electron microscopy data collection with Ptolemy | Dec 1, 2021 | Cryogenic Electron Microscopy (cryo-EM)Navigate | CodeCode Available | 1 |
| Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped Environments with Moving Sounds | Nov 29, 2021 | NavigateVisual Navigation | CodeCode Available | 1 |
| Simple but Effective: CLIP Embeddings for Embodied AI | Nov 18, 2021 | Image ManipulationNavigate | CodeCode Available | 1 |
| Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation | Nov 10, 2021 | DecoderNavigate | CodeCode Available | 1 |
| History Aware Multimodal Transformer for Vision-and-Language Navigation | Oct 25, 2021 | Decision MakingNavigate | CodeCode Available | 1 |
| No RL, No Simulation: Learning to Navigate without Navigating | Oct 18, 2021 | NavigateReinforcement Learning (RL) | CodeCode Available | 1 |
| SGoLAM: Simultaneous Goal Localization and Mapping for Multi-Object Goal Navigation | Oct 14, 2021 | NavigateVisual Navigation | CodeCode Available | 1 |
| AMRA*: Anytime Multi-Resolution Multi-Heuristic A* | Oct 11, 2021 | Heuristic SearchMotion Planning | CodeCode Available | 1 |
| Enhancing Navigational Safety in Crowded Environments using Semantic-Deep-Reinforcement-Learning-based Navigation | Sep 23, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| DPMPC-Planner: A real-time UAV trajectory planning framework for complex static environments with dynamic obstacles | Sep 14, 2021 | Model Predictive ControlNavigate | CodeCode Available | 1 |
| Learning to Navigate Intersections with Unsupervised Driver Trait Inference | Sep 14, 2021 | Autonomous NavigationAutonomous Vehicles | CodeCode Available | 1 |
| TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios | Aug 26, 2021 | Data AugmentationNavigate | CodeCode Available | 1 |
| Airbert: In-domain Pretraining for Vision-and-Language Navigation | Aug 20, 2021 | NavigateReferring Expression | CodeCode Available | 1 |
| Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance | Aug 20, 2021 | DecoderGPU | CodeCode Available | 1 |
| Embodied BERT: A Transformer Model for Embodied, Language-guided Visual Task Completion | Aug 10, 2021 | NavigateObject | CodeCode Available | 1 |
| Towards real-world navigation with deep differentiable planners | Aug 8, 2021 | Imitation LearningMotion Planning | CodeCode Available | 1 |
| Versatile modular neural locomotion control with fast learning | Jul 16, 2021 | Navigate | CodeCode Available | 1 |
| Neighbor-view Enhanced Model for Vision and Language Navigation | Jul 15, 2021 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| Graphhopper: Multi-Hop Scene Graph Reasoning for Visual Question Answering | Jul 13, 2021 | NavigateQuestion Answering | CodeCode Available | 1 |
| Trans4Trans: Efficient Transformer for Transparent Object Segmentation to Help Visually Impaired People Navigate in the Real World | Jul 7, 2021 | DecoderNavigate | CodeCode Available | 1 |
| Collaborative Visual Navigation | Jul 2, 2021 | Multi-agent Reinforcement LearningNavigate | CodeCode Available | 1 |
| Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression | Jun 19, 2021 | Instruction FollowingNavigate | CodeCode Available | 1 |
| Vision-Language Navigation with Random Environmental Mixup | Jun 15, 2021 | Data AugmentationNavigate | CodeCode Available | 1 |
| A Semi-Personalized System for User Cold Start Recommendation on Music Streaming Apps | Jun 7, 2021 | ClusteringNavigate | CodeCode Available | 1 |
| Towards mental time travel: a hierarchical memory for reinforcement learning agents | May 28, 2021 | Meta-LearningNavigate | CodeCode Available | 1 |
| Goal Misgeneralization in Deep Reinforcement Learning | May 28, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Pushing it out of the Way: Interactive Visual Navigation | Apr 28, 2021 | NavigateVisual Navigation | CodeCode Available | 1 |