| MAP Propagation Algorithm: Faster Learning with a Team of Reinforcement Learning Agents | Oct 15, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns | Sep 16, 2021 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 0 |
| Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods | Oct 5, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Combining Reinforcement Learning and Optimal Transport for the Traveling Salesman Problem | Mar 2, 2022 | Combinatorial OptimizationDeep Learning | CodeCode Available | 0 |
| OmniHang: Learning to Hang Arbitrary Objects using Contact Point Correspondences and Neural Collision Estimation | Mar 26, 2021 | Deep Reinforcement LearningObject | CodeCode Available | 0 |
| Reinforcement learning for multi-item retrieval in the puzzle-based storage system | Feb 5, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| On Catastrophic Interference in Atari 2600 Games | Feb 28, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Adapting to Reward Progressivity via Spectral Reinforcement Learning | Apr 29, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Using State Predictions for Value Regularization in Curiosity Driven Deep Reinforcement Learning | Sep 30, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Reinforcement Learning for Physical Layer Communications | Jun 22, 2021 | Deep Reinforcement LearningMulti-Armed Bandits | CodeCode Available | 0 |
| SIGMA: Sheaf-Informed Geometric Multi-Agent Pathfinding | Feb 10, 2025 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning for Robot Navigation with Adaptive Forward Simulation Time (AFST) in a Semi-Markov Model | Aug 13, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning | Nov 19, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Multi-class Imbalanced Training | May 24, 2022 | Deep Reinforcement Learningimbalanced classification | CodeCode Available | 0 |
| Task-Agnostic Dynamics Priors for Deep Reinforcement Learning | May 13, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| An AI Chatbot for Explaining Deep Reinforcement Learning Decisions of Service-oriented Systems | Sep 25, 2023 | ChatbotDecision Making | CodeCode Available | 0 |
| Deep Reinforcement Learning for Mention-Ranking Coreference Models | Sep 27, 2016 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 |
| On Improving Deep Reinforcement Learning for POMDPs | Apr 26, 2017 | Atari GamesDecision Making | CodeCode Available | 0 |
| Autonomous Management of Energy-Harvesting IoT Nodes Using Deep Reinforcement Learning | May 10, 2019 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| Reinforcement Learning Generalization with Surprise Minimization | Apr 26, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| A Deep Reinforcement Learning Approach for Global Routing | Jun 20, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep reinforcement learning for irrigation scheduling using high-dimensional sensor feedback | Jan 2, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Emergent Linguistic Phenomena in Multi-Agent Communication Games | Jan 25, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Emergent Communication through Metropolis-Hastings Naming Game with Deep Generative Models | May 24, 2022 | Bayesian InferenceDeep Reinforcement Learning | CodeCode Available | 0 |
| Emergence of Numeric Concepts in Multi-Agent Autonomous Communication | Nov 4, 2019 | Deep Reinforcement LearningGrounded language learning | CodeCode Available | 0 |
| Emergence of Compositional Language with Deep Generational Transmission | Apr 19, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards | Jun 13, 2019 | Deep Reinforcement LearningFriction | CodeCode Available | 0 |
| Autonomous Braking System via Deep Reinforcement Learning | Feb 8, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Actor-critic versus direct policy search: a comparison based on sample complexity | Jun 29, 2016 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Online Context Learning for Socially Compliant Navigation | Jun 17, 2024 | Deep Reinforcement LearningRobot Navigation | CodeCode Available | 0 |
| Reinforcement Learning of Musculoskeletal Control from Functional Simulations | Jul 13, 2020 | AnatomyDeep Reinforcement Learning | CodeCode Available | 0 |
| Emergence of Adaptive Circadian Rhythms in Deep Reinforcement Learning | Jul 22, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| SimpleDS: A Simple Deep Reinforcement Learning Dialogue System | Jan 18, 2016 | Deep Reinforcement LearningFeature Engineering | CodeCode Available | 0 |
| Efficient Symbolic Policy Learning with Differentiable Symbolic Expression | Nov 2, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots | Jun 10, 2024 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Combining Automated Optimisation of Hyperparameters and Reward Shape | Jun 26, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning | May 30, 2022 | Data PoisoningDeep Reinforcement Learning | CodeCode Available | 0 |
| Online Learning-based Adaptive Beam Switching for 6G Networks: Enhancing Efficiency and Resilience | May 12, 2025 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| Efficient Parallel Methods for Deep Reinforcement Learning | May 13, 2017 | Deep Reinforcement LearningGPU | CodeCode Available | 0 |
| Reinforcement Learning to Disentangle Multiqubit Quantum States from Partial Observations | Jun 12, 2024 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 0 |
| A Deeper Look at Experience Replay | Dec 4, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning Efficient and Fair Policies for Uncertainty-Aware Collaborative Human-Robot Order Picking | Apr 9, 2024 | Deep Reinforcement LearningFairness | CodeCode Available | 0 |
| Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables | Mar 19, 2019 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 |
| Simplifying Deep Reinforcement Learning via Self-Supervision | Jun 10, 2021 | Deep Reinforcement Learningregression | CodeCode Available | 0 |
| Reinforcement Learning via Recurrent Convolutional Neural Networks | Jan 9, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Imbalanced Classification | Jan 5, 2019 | ClassificationDecision Making | CodeCode Available | 0 |
| Automating Reinforcement Learning with Example-based Resets | Apr 5, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Combinational Q-Learning for Dou Di Zhu | Jan 24, 2019 | Atari GamesCard Games | CodeCode Available | 0 |
| Reinforcement Learning with Ensemble Model Predictive Safety Certification | Feb 6, 2024 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning with Average and Discounted Rewards | Aug 18, 2020 | Deep Reinforcement LearningFairness | CodeCode Available | 0 |