| Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future | Sep 27, 2023 | Navigate | CodeCode Available | 2 |
| VidChapters-7M: Video Chapters at Scale | Sep 25, 2023 | Dense Video CaptioningNavigate | CodeCode Available | 2 |
| LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent | Sep 21, 2023 | 3D visual groundingLanguage Modeling | CodeCode Available | 2 |
| AerialVLN: Vision-and-Language Navigation for UAVs | Aug 13, 2023 | cross-modal alignmentNavigate | CodeCode Available | 2 |
| WizMap: Scalable Interactive Visualization for Exploring Large Machine Learning Embeddings | Jun 15, 2023 | Navigate | CodeCode Available | 2 |
| Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory | May 25, 2023 | Common Sense ReasoningCPU | CodeCode Available | 2 |
| ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments | Apr 6, 2023 | Autonomous NavigationNavigate | CodeCode Available | 2 |
| A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles | Jan 20, 2023 | Navigate | CodeCode Available | 2 |
| Melting Pot 2.0 | Nov 24, 2022 | Artificial LifeNavigate | CodeCode Available | 2 |
| MuGER^2: Multi-Granularity Evidence Retrieval and Reasoning for Hybrid Question Answering | Oct 19, 2022 | NavigateQuestion Answering | CodeCode Available | 2 |
| Vision-aided UAV navigation and dynamic obstacle avoidance using gradient-based B-spline trajectory optimization | Sep 15, 2022 | Navigate | CodeCode Available | 2 |
| WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents | Jul 4, 2022 | Decision MakingImitation Learning | CodeCode Available | 2 |
| DayDreamer: World Models for Physical Robot Learning | Jun 28, 2022 | Deep Reinforcement LearningNavigate | CodeCode Available | 2 |
| VectorMapNet: End-to-end Vectorized HD Map Learning | Jun 17, 2022 | 3D Lane DetectionAutonomous Driving | CodeCode Available | 2 |
| Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D Convolutions | Jun 8, 2022 | Autonomous VehiclesNavigate | CodeCode Available | 2 |
| Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation | Feb 23, 2022 | Efficient ExplorationNavigate | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking | Sep 8, 2021 | BenchmarkingDiversity | CodeCode Available | 2 |
| The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind | Jun 25, 2025 | Multi-agent Reinforcement LearningNavigate | CodeCode Available | 1 |
| SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation | Jun 2, 2025 | Domain AdaptationNavigate | CodeCode Available | 1 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| Large Language Models for Planning: A Comprehensive and Systematic Survey | May 26, 2025 | Logical ReasoningNavigate | CodeCode Available | 1 |
| Neural Brain: A Neuroscience-inspired Framework for Embodied Agents | May 12, 2025 | Navigate | CodeCode Available | 1 |
| CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | May 8, 2025 | Large Language ModelNavigate | CodeCode Available | 1 |
| Future-Oriented Navigation: Dynamic Obstacle Avoidance with One-Shot Energy-Based Multimodal Motion Prediction | May 1, 2025 | Model Predictive ControlMotion Planning | CodeCode Available | 1 |