| Efficient Object Detection in Large Images using Deep Reinforcement Learning | Dec 9, 2019 | Deep Reinforcement Learningobject-detection | CodeCode Available | 0 |
| Towards Robust Deep Reinforcement Learning for Traffic Signal Control: Demand Surges, Incidents and Sensor Failures | Apr 17, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning with Low-Complexity Liquid State Machines | Jun 4, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Combining imitation and deep reinforcement learning to accomplish human-level performance on a virtual foraging task | Mar 11, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Reinforcement Learning with Perturbed Rewards | Oct 2, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation | Feb 12, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| ZPD Teaching Strategies for Deep Reinforcement Learning from Demonstrations | Oct 26, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient | Jul 2, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management | Feb 18, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| Learning Heuristics for Quantified Boolean Formulas through Deep Reinforcement Learning | Jul 20, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Reinforcement Learning with Quantum Variational Circuits | Aug 15, 2020 | BIG-bench Machine LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents | Jun 19, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Efficient Information Diffusion in Time-Varying Graphs through Deep Reinforcement Learning | Nov 27, 2020 | Deep Reinforcement LearningGraph Embedding | CodeCode Available | 0 |
| Learning Heuristics over Large Graphs via Deep Reinforcement Learning | Mar 8, 2019 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning Hierarchical Control for Robust In-Hand Manipulation | Oct 24, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning with Unsupervised Auxiliary Tasks | Nov 16, 2016 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Learning how to Active Learn: A Deep Reinforcement Learning Approach | Aug 8, 2017 | Active LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for General Video Game AI | Jun 6, 2018 | Atari GamesBenchmarking | CodeCode Available | 0 |
| Sim-to-Real Reinforcement Learning for Deformable Object Manipulation | Jun 20, 2018 | Deep Reinforcement LearningDeformable Object Manipulation | CodeCode Available | 0 |
| Learning human behaviors from motion capture by adversarial imitation | Jul 7, 2017 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Learning Humanoid Robot Running Skills through Proximal Policy Optimization | Oct 22, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| On-Policy Trust Region Policy Optimisation with Replay Buffers | Jan 18, 2019 | Continuous ControlDeep Reinforcement Learning | CodeCode Available | 0 |
| Reconciling λ-Returns with Experience Replay | Oct 23, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning in Cooperative Multiagent Systems Using Cognitive and Machine Models | Aug 18, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning | Nov 29, 2023 | Deep Reinforcement LearningLong Form Question Answering | CodeCode Available | 0 |
| Deep reinforcement learning for feedback control in a collective flashing ratchet | Nov 20, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Task-Oriented Language Grounding for Language Input with Multiple Sub-Goals of Non-Linear Order | Oct 27, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| On Robust Reinforcement Learning with Lipschitz-Bounded Policy Networks | May 19, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models | Oct 22, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization | Nov 11, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Relational Deep Reinforcement Learning | Jun 5, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| A Centralised Soft Actor Critic Deep Reinforcement Learning Approach to District Demand Side Management through CityLearn | Sep 22, 2020 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| Learning Local Search Heuristics for Boolean Satisfiability | Dec 1, 2019 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| On the consistency of hyper-parameter selection in value-based deep reinforcement learning | Jun 25, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Learning Low-Frequency Motion Control for Robust and Dynamic Robot Locomotion | Sep 29, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Relational Graph Learning for Crowd Navigation | Sep 28, 2019 | Deep Reinforcement LearningGraph Learning | CodeCode Available | 0 |
| Learning Manipulation Tasks in Dynamic and Shared 3D Spaces | Apr 26, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Efficient Deep Reinforcement Learning via Adaptive Policy Transfer | Feb 19, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Automatic Data Augmentation by Learning the Deterministic Policy | Oct 18, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 |
| Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling | Dec 18, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Task Relabelling for Multi-task Transfer using Successor Features | May 20, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Learning Model-Free Robust Precoding for Cooperative Multibeam Satellite Communications | Mar 13, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Adaptive trajectory-constrained exploration strategy for deep reinforcement learning | Dec 27, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes | Sep 19, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Efficient and Scalable Deep Reinforcement Learning for Mean Field Control Games | Dec 28, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Colonoscopy Navigation using End-to-End Deep Visuomotor Control: A User Study | Jun 30, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Learning Multi-Objective Curricula for Robotic Policy Learning | Oct 6, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages | Jun 2, 2023 | Bayesian Inferencecontinuous-control | CodeCode Available | 0 |
| Remember and Forget for Experience Replay | Jul 16, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |