| ModelicaGym: Applying Reinforcement Learning to Modelica Models | Sep 18, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Split Deep Q-Learning for Robust Object Singulation | Sep 17, 2019 | feature selectionObject | —Unverified | 0 |
| ISL: A novel approach for deep exploration | Sep 13, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Joint Inference of Reward Machines and Policies for Reinforcement Learning | Sep 12, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| SQLR: Short-Term Memory Q-Learning for Elastic Provisioning | Sep 12, 2019 | BlockingQ-Learning | —Unverified | 0 |
| Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders | Sep 11, 2019 | Decision MakingQ-Learning | —Unverified | 0 |
| Q-learning Assisted Energy-Aware Traffic Offloading and Cell Switching in Heterogeneous Networks | Sep 11, 2019 | Q-Learning | —Unverified | 0 |
| Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning | Sep 11, 2019 | MuJoCoQ-Learning | —Unverified | 0 |
| A Deep Learning Approach to Grasping the Invisible | Sep 11, 2019 | Deep LearningQ-Learning | CodeCode Available | 0 |
| Q-Learning Based Aerial Base Station Placement for Fairness Enhancement in Mobile Networks | Sep 10, 2019 | FairnessQ-Learning | —Unverified | 0 |
| A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation | Sep 10, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning | Sep 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Self-driving scale car trained by Deep reinforcement learning | Sep 8, 2019 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Multi Pseudo Q-learning Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles | Sep 7, 2019 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Control of Probabilistic Boolean Networks | Sep 7, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Gradient Q(σ, λ): A Unified Algorithm with Function Approximation for Reinforcement Learning | Sep 6, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Encoders and Decoders for Quantum Expander Codes Using Machine Learning | Sep 6, 2019 | BIG-bench Machine LearningDecoder | —Unverified | 0 |
| Q-DATA: Enhanced Traffic Flow Monitoring in Software-Defined Networks applying Q-learning | Sep 4, 2019 | ManagementQ-Learning | —Unverified | 0 |
| rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch | Sep 3, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 2 |
| Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms | Sep 1, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity | Aug 29, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| STMARL: A Spatio-Temporal Multi-Agent Reinforcement Learning Approach for Cooperative Traffic Light Control | Aug 28, 2019 | Graph Neural NetworkManagement | —Unverified | 0 |
| Networked Control of Nonlinear Systems under Partial Observation Using Continuous Deep Q-Learning | Aug 28, 2019 | Q-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Foreign Exchange Trading | Aug 21, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Performing Deep Recurrent Double Q-Learning for Atari Games | Aug 16, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Learn How to Cook a New Recipe in a New House: Using Map Familiarization, Curriculum Learning, and Bandit Feedback to Learn Families of Text-Based Adventure Games | Aug 13, 2019 | Common Sense ReasoningQ-Learning | CodeCode Available | 0 |
| Large-Scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning | Aug 10, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Batch Recurrent Q-Learning for Backchannel Generation Towards Engaging Agents | Aug 6, 2019 | Imitation LearningQ-Learning | —Unverified | 0 |
| Control of nonlinear, complex and black-boxed greenhouse system with reinforcement learning | Jul 30, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Q-MIND: Defeating Stealthy DoS Attacks in SDN with a Machine-learning based Defense Framework | Jul 27, 2019 | Anomaly DetectionBIG-bench Machine Learning | —Unverified | 0 |
| Towards Model-based Reinforcement Learning for Industry-near Environments | Jul 27, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 0 |
| Potential-Based Advice for Stochastic Policy Learning | Jul 20, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Photonic architecture for reinforcement learning | Jul 17, 2019 | Active LearningQ-Learning | —Unverified | 0 |
| Model-free Control of Chaos with Continuous Deep Q-learning | Jul 16, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Optimistic Perspective on Offline Reinforcement Learning | Jul 10, 2019 | Atari GamesDiversity | CodeCode Available | 1 |
| An intelligent financial portfolio trading strategy using deep Q-learning | Jul 8, 2019 | Q-Learning | CodeCode Available | 0 |
| Q-learning pour la r\'esolution des anaphores pronominales en langue arabe (Q-learning for pronominal anaphora resolution in Arabic texts) | Jul 1, 2019 | Q-Learning | —Unverified | 0 |
| Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog | Jun 30, 2019 | Deep Reinforcement LearningOpen-Domain Dialog | —Unverified | 0 |
| QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security Game | Jun 27, 2019 | OpenAI GymQ-Learning | CodeCode Available | 0 |
| Q-Learning Inspired Self-Tuning for Energy Efficiency in HPC | Jun 26, 2019 | Q-Learning | —Unverified | 0 |
| Towards Empathic Deep Q-Learning | Jun 26, 2019 | EthicsQ-Learning | CodeCode Available | 0 |
| Deceptive Reinforcement Learning Under Adversarial Manipulations on Cost Signals | Jun 24, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Optimal Use of Experience in First Person Shooter Environments | Jun 24, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| In Hindsight: A Smooth Reward for Steady Exploration | Jun 24, 2019 | Atari GamesQ-Learning | —Unverified | 0 |
| Neural networks with motivation | Jun 23, 2019 | Hierarchical Reinforcement LearningNavigate | —Unverified | 0 |
| Reinforcement Learning-Based Trajectory Design for the Aerial Base Stations | Jun 23, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry | Jun 21, 2019 | Decision MakingLifelong learning | CodeCode Available | 1 |
| Split Q Learning: Reinforcement Learning with Two-Stream Rewards | Jun 21, 2019 | Decision MakingQ-Learning | CodeCode Available | 1 |
| Cache-Aided NOMA Mobile Edge Computing: A Reinforcement Learning Approach | Jun 20, 2019 | Edge-computingQ-Learning | —Unverified | 0 |
| Reward Prediction Error as an Exploration Objective in Deep RL | Jun 19, 2019 | Atari GamesContinuous Control | —Unverified | 0 |