| Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation | Feb 23, 2022 | Efficient ExplorationNavigate | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking | Sep 8, 2021 | BenchmarkingDiversity | CodeCode Available | 2 |
| The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind | Jun 25, 2025 | Multi-agent Reinforcement LearningNavigate | CodeCode Available | 1 |
| SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation | Jun 2, 2025 | Domain AdaptationNavigate | CodeCode Available | 1 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| Large Language Models for Planning: A Comprehensive and Systematic Survey | May 26, 2025 | Logical ReasoningNavigate | CodeCode Available | 1 |
| Neural Brain: A Neuroscience-inspired Framework for Embodied Agents | May 12, 2025 | Navigate | CodeCode Available | 1 |
| CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | May 8, 2025 | Large Language ModelNavigate | CodeCode Available | 1 |
| Future-Oriented Navigation: Dynamic Obstacle Avoidance with One-Shot Energy-Based Multimodal Motion Prediction | May 1, 2025 | Model Predictive ControlMotion Planning | CodeCode Available | 1 |