| Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document Traversal | May 23, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications | Oct 18, 2021 | Deep Reinforcement Learning | CodeCode Available | 0 |
| XAI-N: Sensor-based Robot Navigation using Expert Policies and Decision Trees | Apr 22, 2021 | Deep Reinforcement LearningExplainable Artificial Intelligence (XAI) | CodeCode Available | 0 |
| Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models | Jun 11, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 |
| A Critical Investigation of Deep Reinforcement Learning for Navigation | Feb 7, 2018 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Incorporating Graph Attention Mechanism into Geometric Problem Solving Based on Deep Reinforcement Learning | Mar 14, 2024 | Deep Reinforcement LearningGraph Attention | CodeCode Available | 0 |
| SUPERVISED POLICY UPDATE | May 1, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Supervised Policy Update for Deep Reinforcement Learning | May 29, 2018 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Increasing performance of electric vehicles in ride-hailing services using deep reinforcement learning | Dec 7, 2019 | Autonomous VehiclesDecision Making | CodeCode Available | 0 |
| R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics | Aug 29, 2023 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 0 |
| Balancing Value Underestimation and Overestimation with Realistic Actor-Critic | Oct 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game | Sep 19, 2018 | AI AgentDecision Making | CodeCode Available | 0 |
| Racing Control Variable Genetic Programming for Symbolic Regression | Sep 13, 2023 | Deep Reinforcement Learningregression | CodeCode Available | 0 |
| Suphx: Mastering Mahjong with Deep Reinforcement Learning | Mar 30, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| ScrofaZero: Mastering Trick-taking Poker Game Gongzhu by Deep Reinforcement Learning | Feb 15, 2021 | Bayesian InferenceDeep Reinforcement Learning | CodeCode Available | 0 |
| RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient Minimum Radiation Exposure Pathway | Feb 1, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning Framework for Thoracic Diseases Classification via Prior Knowledge Guidance | Jun 2, 2023 | Deep Reinforcement LearningDiagnostic | CodeCode Available | 0 |
| Conversational Recommender System | Jun 8, 2018 | Conversational RecommendationDeep Reinforcement Learning | CodeCode Available | 0 |
| An Exploration of Deep Learning Methods in Hungry Geese | Sep 5, 2021 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Influence-aware Memory Architectures for Deep Reinforcement Learning | Nov 18, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Influencing Reinforcement Learning through Natural Language Guidance | Apr 4, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Randomized Prior Functions for Deep Reinforcement Learning | Jun 8, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Information-Directed Exploration for Deep Reinforcement Learning | Dec 18, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Information-Driven Adaptive Sensing Based on Deep Reinforcement Learning | Oct 8, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning | May 7, 2025 | ClusteringDeep Reinforcement Learning | CodeCode Available | 0 |
| Multi-Objective Deep Reinforcement Learning | Oct 9, 2016 | Deep Reinforcement LearningMulti-Objective Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning framework for Autonomous Driving | Apr 8, 2017 | Atari GamesAutonomous Driving | CodeCode Available | 0 |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Oct 27, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Random Projection in Neural Episodic Control | Apr 3, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation | Jul 18, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Instance based Generalization in Reinforcement Learning | Nov 2, 2020 | Deep Reinforcement LearningGeneralization Bounds | CodeCode Available | 0 |
| Ranking for Relevance and Display Preferences in Complex Presentation Layouts | May 7, 2018 | Deep Reinforcement LearningLearning-To-Rank | CodeCode Available | 0 |
| Instance Selection for Dynamic Algorithm Configuration with Reinforcement Learning: Improving Generalization | Jul 18, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Multi-objective Pointer Network for Combinatorial Optimization | Apr 25, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| A DRL solution to help reduce the cost in waiting time of securing a traffic light for cyclists | Nov 23, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Multi Objective Prioritized Workflow Scheduling Using Deep Reinforcement Based Learning in Cloud Computing | Jan 8, 2024 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 0 |
| Surprising Negative Results for Generative Adversarial Tree Search | Jun 15, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations | Apr 12, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Extracting Diagnosis Pathways from Electronic Health Records Using Deep Reinforcement Learning | May 10, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments | Jun 17, 2025 | Atari GamesBoard Games | CodeCode Available | 0 |
| Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods | Feb 28, 2018 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| ABCP: Automatic Block-wise and Channel-wise Network Pruning via Joint Search | Oct 8, 2021 | Deep Reinforcement LearningNetwork Pruning | CodeCode Available | 0 |
| Rate-Splitting for Intelligent Reflecting Surface-Aided Multiuser VR Streaming | Oct 21, 2022 | Continuous ControlDeep Reinforcement Learning | CodeCode Available | 0 |
| Multi-Pass Q-Networks for Deep Reinforcement Learning with Parameterised Action Spaces | May 10, 2019 | Control with Prametrised ActionsDeep Reinforcement Learning | CodeCode Available | 0 |
| ADESSE: Advice Explanations in Complex Repeated Decision-Making Environments | May 31, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Exploring Unknown States with Action Balance | Mar 10, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Towards Disturbance-Free Visual Mobile Manipulation | Dec 17, 2021 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 0 |
| Verifiably Robust Conformal Prediction | May 29, 2024 | Conformal PredictionDeep Reinforcement Learning | CodeCode Available | 0 |
| Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning | Jun 8, 2016 | Deep Reinforcement Learningdialog state tracking | CodeCode Available | 0 |
| AACHER: Assorted Actor-Critic Deep Reinforcement Learning with Hindsight Experience Replay | Oct 24, 2022 | Deep Reinforcement LearningFetchPush-v1 | CodeCode Available | 0 |