| A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning | Oct 9, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Toward Synergic Learning for Autonomous Manipulation of Deformable Tissues via Surgical Robots: An Approximate Q-Learning Approach | Oct 8, 2019 | Q-Learning | —Unverified | 0 |
| Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals | Oct 8, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions | Oct 7, 2019 | Atari GamesQ-Learning | —Unverified | 0 |
| Combining No-regret and Q-learning | Oct 7, 2019 | counterfactualQ-Learning | CodeCode Available | 0 |
| I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action | Oct 4, 2019 | Industrial RobotsQ-Learning | —Unverified | 0 |
| Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping | Oct 1, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Fair Loss: Margin-Aware Reinforcement Learning for Deep Face Recognition | Oct 1, 2019 | Face RecognitionQ-Learning | —Unverified | 0 |
| Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization | Sep 30, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Meta-Q-Learning | Sep 30, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Q-learning for POMDP: An application to learning locomotion gaits | Sep 30, 2019 | Q-Learning | —Unverified | 0 |
| Deep Coordination Graphs | Sep 27, 2019 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Visual Exploration and Energy-aware Path Planning via Reinforcement Learning | Sep 26, 2019 | Autonomous Vehiclesobject-detection | CodeCode Available | 0 |
| CAQL: Continuous Action Q-Learning | Sep 26, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Long-term planning, short-term adjustments | Sep 25, 2019 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| CAN ALTQ LEARN FASTER: EXPERIMENTS AND THEORY | Sep 25, 2019 | Atari GamesQ-Learning | —Unverified | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Striving for Simplicity in Off-Policy Deep Reinforcement Learning | Sep 25, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Off-policy Multi-step Q-learning | Sep 25, 2019 | Q-Learning | —Unverified | 0 |
| Modeling Fake News in Social Networks with Deep Multi-Agent Reinforcement Learning | Sep 25, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Active inference: demystified and compared | Sep 24, 2019 | Atari GamesOpenAI Gym | CodeCode Available | 0 |
| On the Convergence of Approximate and Regularized Policy Iteration Schemes | Sep 20, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Dependency-Aware Computation Offloading in Mobile Edge Computing: A Reinforcement Learning Approach | Sep 18, 2019 | Cloud ComputingEdge-computing | —Unverified | 0 |
| Split Deep Q-Learning for Robust Object Singulation | Sep 17, 2019 | feature selectionObject | —Unverified | 0 |
| ISL: A novel approach for deep exploration | Sep 13, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Joint Inference of Reward Machines and Policies for Reinforcement Learning | Sep 12, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| SQLR: Short-Term Memory Q-Learning for Elastic Provisioning | Sep 12, 2019 | BlockingQ-Learning | —Unverified | 0 |
| Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders | Sep 11, 2019 | Decision MakingQ-Learning | —Unverified | 0 |
| Q-learning Assisted Energy-Aware Traffic Offloading and Cell Switching in Heterogeneous Networks | Sep 11, 2019 | Q-Learning | —Unverified | 0 |
| A Deep Learning Approach to Grasping the Invisible | Sep 11, 2019 | Deep LearningQ-Learning | CodeCode Available | 0 |
| Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning | Sep 11, 2019 | MuJoCoQ-Learning | —Unverified | 0 |
| A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation | Sep 10, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Q-Learning Based Aerial Base Station Placement for Fairness Enhancement in Mobile Networks | Sep 10, 2019 | FairnessQ-Learning | —Unverified | 0 |
| Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning | Sep 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Self-driving scale car trained by Deep reinforcement learning | Sep 8, 2019 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Multi Pseudo Q-learning Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles | Sep 7, 2019 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Control of Probabilistic Boolean Networks | Sep 7, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Encoders and Decoders for Quantum Expander Codes Using Machine Learning | Sep 6, 2019 | BIG-bench Machine LearningDecoder | —Unverified | 0 |
| Gradient Q(σ, λ): A Unified Algorithm with Function Approximation for Reinforcement Learning | Sep 6, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Q-DATA: Enhanced Traffic Flow Monitoring in Software-Defined Networks applying Q-learning | Sep 4, 2019 | ManagementQ-Learning | —Unverified | 0 |
| Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms | Sep 1, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity | Aug 29, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Networked Control of Nonlinear Systems under Partial Observation Using Continuous Deep Q-Learning | Aug 28, 2019 | Q-Learning | —Unverified | 0 |
| STMARL: A Spatio-Temporal Multi-Agent Reinforcement Learning Approach for Cooperative Traffic Light Control | Aug 28, 2019 | Graph Neural NetworkManagement | —Unverified | 0 |
| Deep Reinforcement Learning for Foreign Exchange Trading | Aug 21, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Performing Deep Recurrent Double Q-Learning for Atari Games | Aug 16, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Learn How to Cook a New Recipe in a New House: Using Map Familiarization, Curriculum Learning, and Bandit Feedback to Learn Families of Text-Based Adventure Games | Aug 13, 2019 | Common Sense ReasoningQ-Learning | CodeCode Available | 0 |
| Large-Scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning | Aug 10, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Batch Recurrent Q-Learning for Backchannel Generation Towards Engaging Agents | Aug 6, 2019 | Imitation LearningQ-Learning | —Unverified | 0 |