| GridMM: Grid Memory Map for Vision-and-Language Navigation | Jul 24, 2023 | NavigateVision and Language Navigation | CodeCode Available | 1 | 5 |
| AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations | Jan 17, 2025 | Contrastive LearningNavigate | CodeCode Available | 1 | 5 |
| Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation | Mar 30, 2022 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment | Feb 29, 2024 | Navigate | CodeCode Available | 1 | 5 |
| Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs | Sep 27, 2023 | FormNavigate | CodeCode Available | 1 | 5 |
| DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better | Aug 10, 2019 | Blind Face RestorationDeblurring | CodeCode Available | 1 | 5 |
| Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban Environments | Feb 9, 2023 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 1 | 5 |
| Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors | Mar 31, 2021 | 3D Human Pose Estimation3D Pose Estimation | CodeCode Available | 1 | 5 |
| AI-IMU Dead-Reckoning | Apr 12, 2019 | Dead-Reckoning PredictionNavigate | CodeCode Available | 1 | 5 |
| AidUI: Toward Automated Recognition of Dark Patterns in User Interfaces | Mar 12, 2023 | Navigate | CodeCode Available | 1 | 5 |
| From Shadows to Safety: Occlusion Tracking and Risk Mitigation for Urban Autonomous Driving | Apr 2, 2025 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 | 5 |
| Continual Multimodal Knowledge Graph Construction | May 15, 2023 | Continual Learninggraph construction | CodeCode Available | 1 | 5 |
| CoNav: A Benchmark for Human-Centered Collaborative Navigation | Jun 4, 2024 | Navigate | CodeCode Available | 1 | 5 |
| Airbert: In-domain Pretraining for Vision-and-Language Navigation | Aug 20, 2021 | NavigateReferring Expression | CodeCode Available | 1 | 5 |
| FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization | Dec 2, 2021 | counterfactualImage Generation | CodeCode Available | 1 | 5 |
| Future-Oriented Navigation: Dynamic Obstacle Avoidance with One-Shot Energy-Based Multimodal Motion Prediction | May 1, 2025 | Model Predictive ControlMotion Planning | CodeCode Available | 1 | 5 |
| Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning | Nov 15, 2017 | NavigateReinforcement Learning | CodeCode Available | 1 | 5 |
| Graphhopper: Multi-Hop Scene Graph Reasoning for Visual Question Answering | Jul 13, 2021 | NavigateQuestion Answering | CodeCode Available | 1 | 5 |
| AI2-THOR: An Interactive 3D Environment for Visual AI | Dec 14, 2017 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 | 5 |
| Collaborative Visual Navigation | Jul 2, 2021 | Multi-agent Reinforcement LearningNavigate | CodeCode Available | 1 | 5 |
| Crowd-Robot Interaction: Crowd-aware Robot Navigation with Attention-based Deep Reinforcement Learning | Sep 24, 2018 | Deep Reinforcement LearningHuman Dynamics | CodeCode Available | 1 | 5 |
| CUAHN-VIO: Content-and-Uncertainty-Aware Homography Network for Visual-Inertial Odometry | Aug 30, 2022 | Motion EstimationNavigate | CodeCode Available | 1 | 5 |
| FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting | Mar 19, 2024 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 | 5 |
| CuriousLLM: Elevating Multi-Document QA with Reasoning-Infused Knowledge Graph Prompting | Apr 13, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 1 | 5 |
| Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning | Mar 28, 2022 | Distributional Reinforcement LearningDrone navigation | CodeCode Available | 1 | 5 |
| From Commands to Prompts: LLM-based Semantic File System for AIOS | Sep 23, 2024 | ManagementNavigate | CodeCode Available | 1 | 5 |
| Digital Twin-Enhanced Wireless Indoor Navigation: Achieving Efficient Environment Sensing with Zero-Shot Reinforcement Learning | Jun 11, 2023 | Navigatereinforcement-learning | CodeCode Available | 1 | 5 |
| Extracting a Knowledge Base of Mechanisms from COVID-19 Papers | Oct 8, 2020 | Navigate | CodeCode Available | 1 | 5 |
| Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository | Apr 22, 2024 | Class-level Code GenerationCode Generation | CodeCode Available | 1 | 5 |
| FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras | Apr 21, 2021 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 1 | 5 |
| Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments | Jul 31, 2024 | graph constructionNavigate | CodeCode Available | 1 | 5 |
| Exploring Gradient-based Multi-directional Controls in GANs | Sep 1, 2022 | AttributeDisentanglement | CodeCode Available | 1 | 5 |
| CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | May 8, 2025 | Large Language ModelNavigate | CodeCode Available | 1 | 5 |
| DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames | Nov 1, 2019 | Autonomous NavigationGPU | CodeCode Available | 1 | 5 |
| AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios | Oct 25, 2024 | BenchmarkingDiversity | CodeCode Available | 1 | 5 |
| 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation | Aug 31, 2023 | NavigateReferring Expression | CodeCode Available | 1 | 5 |
| CFGPT: Chinese Financial Assistant with Large Language Model | Sep 19, 2023 | Decision MakingFinancial Analysis | CodeCode Available | 1 | 5 |
| Exploring Empty Spaces: Human-in-the-Loop Data Augmentation | Oct 1, 2024 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| Fleet of Agents: Coordinated Problem Solving with Large Language Models | May 7, 2024 | Navigate | CodeCode Available | 1 | 5 |
| IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving | Nov 7, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 | 5 |
| Can GPT-4 Perform Neural Architecture Search? | Apr 21, 2023 | NavigateNeural Architecture Search | CodeCode Available | 1 | 5 |
| A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks | Dec 20, 2023 | Model SelectionNavigate | CodeCode Available | 1 | 5 |
| ONCE: Boosting Content-based Recommendation with Both Open- and Closed-source Large Language Models | May 11, 2023 | NavigateNews Generation | CodeCode Available | 1 | 5 |
| Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning | Oct 5, 2023 | NavigateSpatial Reasoning | CodeCode Available | 1 | 5 |
| AEye: A Visualization Tool for Image Datasets | Aug 7, 2024 | Navigate | CodeCode Available | 1 | 5 |
| Evaluating Language Models for Mathematics through Interactions | Jun 2, 2023 | Language ModellingMathematical Problem-Solving | CodeCode Available | 1 | 5 |
| EnvEdit: Environment Editing for Vision-and-Language Navigation | Mar 29, 2022 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| Evaluating Long-Term Memory in 3D Mazes | Oct 24, 2022 | Navigatereinforcement-learning | CodeCode Available | 1 | 5 |
| Expander Graph Propagation | Oct 6, 2022 | Graph ClassificationGraph Representation Learning | CodeCode Available | 1 | 5 |
| Aerial Vision-and-Dialog Navigation | May 24, 2022 | Navigate | CodeCode Available | 1 | 5 |