| Auxiliary Tasks Speed Up Learning PointGoal Navigation | Jul 9, 2020 | GPUNavigate | CodeCode Available | 1 | 5 |
| Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player | Feb 21, 2021 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames | Nov 1, 2019 | Autonomous NavigationGPU | CodeCode Available | 1 | 5 |
| Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization | Mar 24, 2025 | NavigateScheduling | CodeCode Available | 1 | 5 |
| DroidBot-GPT: GPT-powered UI Automation for Android | Apr 14, 2023 | Navigate | CodeCode Available | 1 | 5 |
| EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy | Jun 19, 2024 | Exposure CorrectionImage Enhancement | CodeCode Available | 1 | 5 |
| Do Pedestrians Pay Attention? Eye Contact Detection in the Wild | Dec 8, 2021 | Autonomous VehiclesContact Detection | CodeCode Available | 1 | 5 |
| MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration | Apr 17, 2022 | NavigateRetrieval | CodeCode Available | 1 | 5 |
| BioImage.IO Chatbot: A Community-Driven AI Assistant for Integrative Computational Bioimaging | Oct 23, 2023 | ChatbotInformation Retrieval | CodeCode Available | 1 | 5 |
| DPMPC-Planner: A real-time UAV trajectory planning framework for complex static environments with dynamic obstacles | Sep 14, 2021 | Model Predictive ControlNavigate | CodeCode Available | 1 | 5 |
| Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation | Nov 10, 2021 | DecoderNavigate | CodeCode Available | 1 | 5 |
| Continual Multimodal Knowledge Graph Construction | May 15, 2023 | Continual Learninggraph construction | CodeCode Available | 1 | 5 |
| General Evaluation for Instruction Conditioned Navigation using Dynamic Time Warping | Jul 11, 2019 | Dynamic Time WarpingNavigate | CodeCode Available | 1 | 5 |
| Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL | Sep 13, 2023 | Arithmetic ReasoningNavigate | CodeCode Available | 1 | 5 |
| Advances in 3D Neural Stylization: A Survey | Nov 30, 2023 | NavigateNeural Stylization | CodeCode Available | 1 | 5 |
| One-Shot Informed Robotic Visual Search in the Wild | Mar 22, 2020 | NavigateRepresentation Learning | CodeCode Available | 1 | 5 |
| Online Domain Adaptation for Occupancy Mapping | Jul 1, 2020 | Autonomous DrivingDomain Adaptation | CodeCode Available | 1 | 5 |
| Embodied BERT: A Transformer Model for Embodied, Language-guided Visual Task Completion | Aug 10, 2021 | NavigateObject | CodeCode Available | 1 | 5 |
| Enhancing Safety in Mixed Traffic: Learning-Based Modeling and Efficient Control of Autonomous and Human-Driven Vehicles | Apr 10, 2024 | Autonomous VehiclesModel Predictive Control | CodeCode Available | 1 | 5 |
| Evaluating Language Models for Mathematics through Interactions | Jun 2, 2023 | Language ModellingMathematical Problem-Solving | CodeCode Available | 1 | 5 |
| Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs | Sep 27, 2023 | FormNavigate | CodeCode Available | 1 | 5 |
| Entering Real Social World! Benchmarking the Social Intelligence of Large Language Models from a First-person Perspective | Oct 8, 2024 | AttributeBenchmarking | CodeCode Available | 1 | 5 |
| MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows | Jun 10, 2024 | Navigate | CodeCode Available | 1 | 5 |
| OtoWorld: Towards Learning to Separate by Learning to Move | Jul 12, 2020 | Audio Source SeparationNavigate | CodeCode Available | 1 | 5 |
| AutoTrans: Automating Transformer Design via Reinforced Architecture Search | Sep 4, 2020 | Natural Language UnderstandingNavigate | CodeCode Available | 1 | 5 |
| MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain | Oct 7, 2024 | AttributeMetric Learning | CodeCode Available | 1 | 5 |
| Expander Graph Propagation | Oct 6, 2022 | Graph ClassificationGraph Representation Learning | CodeCode Available | 1 | 5 |
| Evaluating Long-Term Memory in 3D Mazes | Oct 24, 2022 | Navigatereinforcement-learning | CodeCode Available | 1 | 5 |
| Mask4D: End-to-End Mask-Based 4D Panoptic Segmentation for LiDAR Sequences | Sep 18, 2023 | 3D Panoptic Segmentation4D Panoptic Segmentation | CodeCode Available | 1 | 5 |
| CoNav: A Benchmark for Human-Centered Collaborative Navigation | Jun 4, 2024 | Navigate | CodeCode Available | 1 | 5 |
| Map-based Modular Approach for Zero-shot Embodied Question Answering | May 26, 2024 | Embodied Question AnsweringNavigate | CodeCode Available | 1 | 5 |
| Maneuver-based Anchor Trajectory Hypotheses at Roundabouts | Apr 22, 2021 | Decodermotion prediction | CodeCode Available | 1 | 5 |
| Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment | Feb 29, 2024 | Navigate | CodeCode Available | 1 | 5 |
| Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning | Oct 5, 2023 | NavigateSpatial Reasoning | CodeCode Available | 1 | 5 |
| Manipulating Embeddings of Stable Diffusion Prompts | Aug 23, 2023 | Image GenerationNavigate | CodeCode Available | 1 | 5 |
| Marathon Environments: Multi-Agent Continuous Control Benchmarks in a Modern Video Game Engine | Feb 25, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| MemoNav: Working Memory Model for Visual Navigation | Feb 29, 2024 | Decision MakingGraph Attention | CodeCode Available | 1 | 5 |
| Possibility Before Utility: Learning And Using Hierarchical Affordances | Mar 23, 2022 | Hierarchical Reinforcement LearningNavigate | CodeCode Available | 1 | 5 |
| 3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes | Jan 12, 2025 | NavigateObject | CodeCode Available | 1 | 5 |
| Collaborative Visual Navigation | Jul 2, 2021 | Multi-agent Reinforcement LearningNavigate | CodeCode Available | 1 | 5 |
| Extracting a Knowledge Base of Mechanisms from COVID-19 Papers | Oct 8, 2020 | Navigate | CodeCode Available | 1 | 5 |
| Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped Environments with Moving Sounds | Nov 29, 2021 | NavigateVisual Navigation | CodeCode Available | 1 | 5 |
| Look Further Ahead: Testing the Limits of GPT-4 in Path Planning | Jun 17, 2024 | Navigate | CodeCode Available | 1 | 5 |
| FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting | Mar 19, 2024 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 | 5 |
| Long-term Human Motion Prediction with Scene Context | Jul 7, 2020 | Human motion predictionmotion prediction | CodeCode Available | 1 | 5 |
| CFGPT: Chinese Financial Assistant with Large Language Model | Sep 19, 2023 | Decision MakingFinancial Analysis | CodeCode Available | 1 | 5 |
| Aerial Vision-and-Dialog Navigation | May 24, 2022 | Navigate | CodeCode Available | 1 | 5 |
| Fleet of Agents: Coordinated Problem Solving with Large Language Models | May 7, 2024 | Navigate | CodeCode Available | 1 | 5 |
| Machine learning as a model for cultural learning: Teaching an algorithm what it means to be fat | Mar 24, 2020 | ArticlesCultural Vocal Bursts Intensity Prediction | CodeCode Available | 1 | 5 |
| Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository | Apr 22, 2024 | Class-level Code GenerationCode Generation | CodeCode Available | 1 | 5 |