| Quantum Observables for continuous control of the Quantum Approximate Optimization Algorithm via Reinforcement Learning | Nov 21, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Efficient Drone Mobility Support Using Reinforcement Learning | Nov 21, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Asymptotics of Reinforcement Learning with Neural Networks | Nov 13, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Modelling Bahdanau Attention using Election methods aided by Q-Learning | Nov 10, 2019 | DecoderMachine Translation | —Unverified | 0 |
| Two-stage WECC Composite Load Modeling: A Double Deep Q-Learning Networks Approach | Nov 8, 2019 | Q-Learning | —Unverified | 0 |
| Challenging On Car Racing Problem from OpenAI gym | Nov 2, 2019 | Car Racingcontinuous-control | —Unverified | 0 |
| On Solving the 2-Dimensional Greedy Shooter Problem for UAVs | Nov 2, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Generalized Speedy Q-learning | Nov 1, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning | Oct 28, 2019 | General Reinforcement LearningQ-Learning | —Unverified | 0 |
| Biomimetic Ultra-Broadband Perfect Absorbers Optimised with Reinforcement Learning | Oct 28, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Oct 27, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| ZPD Teaching Strategies for Deep Reinforcement Learning from Demonstrations | Oct 26, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| D-Point Trigonometric Path Planning based on Q-Learning in Uncertain Environments | Oct 26, 2019 | PositionQ-Learning | —Unverified | 0 |
| Deep Q-Learning for Same-Day Delivery with Vehicles and Drones | Oct 25, 2019 | Decision MakingQ-Learning | —Unverified | 0 |
| Momentum-based Accelerated Q-learning | Oct 23, 2019 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Partially Detected Intelligent Traffic Signal Control: Environmental Adaptation | Oct 23, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Policy Learning for Malaria Control | Oct 20, 2019 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Reverse Experience Replay | Oct 19, 2019 | Q-Learning | —Unverified | 0 |
| Automatic Data Augmentation by Learning the Deterministic Policy | Oct 18, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 |
| Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces | Oct 17, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference | Oct 15, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| On the Reduction of Variance and Overestimation of Deep Q-Learning | Oct 14, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Zap Q-Learning With Nonlinear Function Approximation | Oct 11, 2019 | OpenAI GymQ-Learning | —Unverified | 0 |
| Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments | Oct 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning | Oct 9, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals | Oct 8, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Toward Synergic Learning for Autonomous Manipulation of Deformable Tissues via Surgical Robots: An Approximate Q-Learning Approach | Oct 8, 2019 | Q-Learning | —Unverified | 0 |
| Combining No-regret and Q-learning | Oct 7, 2019 | counterfactualQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions | Oct 7, 2019 | Atari GamesQ-Learning | —Unverified | 0 |
| I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action | Oct 4, 2019 | Industrial RobotsQ-Learning | —Unverified | 0 |
| Benchmarking Batch Deep Reinforcement Learning Algorithms | Oct 3, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Fair Loss: Margin-Aware Reinforcement Learning for Deep Face Recognition | Oct 1, 2019 | Face RecognitionQ-Learning | —Unverified | 0 |
| Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping | Oct 1, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Q-learning for POMDP: An application to learning locomotion gaits | Sep 30, 2019 | Q-Learning | —Unverified | 0 |
| Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization | Sep 30, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Meta-Q-Learning | Sep 30, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Deep Coordination Graphs | Sep 27, 2019 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| CAQL: Continuous Action Q-Learning | Sep 26, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? | Sep 26, 2019 | Feature EngineeringQ-Learning | CodeCode Available | 1 |
| Visual Exploration and Energy-aware Path Planning via Reinforcement Learning | Sep 26, 2019 | Autonomous Vehiclesobject-detection | CodeCode Available | 0 |
| QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Off-policy Multi-step Q-learning | Sep 25, 2019 | Q-Learning | —Unverified | 0 |
| Modeling Fake News in Social Networks with Deep Multi-Agent Reinforcement Learning | Sep 25, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Long-term planning, short-term adjustments | Sep 25, 2019 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| Striving for Simplicity in Off-Policy Deep Reinforcement Learning | Sep 25, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| CAN ALTQ LEARN FASTER: EXPERIMENTS AND THEORY | Sep 25, 2019 | Atari GamesQ-Learning | —Unverified | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Active inference: demystified and compared | Sep 24, 2019 | Atari GamesOpenAI Gym | CodeCode Available | 0 |
| On the Convergence of Approximate and Regularized Policy Iteration Schemes | Sep 20, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Dependency-Aware Computation Offloading in Mobile Edge Computing: A Reinforcement Learning Approach | Sep 18, 2019 | Cloud ComputingEdge-computing | —Unverified | 0 |