| Hamilton-Jacobi-Bellman Equations for Q-Learning in Continuous Time | Dec 23, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Soft Q Network | Dec 20, 2019 | Q-Learning | —Unverified | 0 |
| Sepsis World Model: A MIMIC-based OpenAI Gym "World Model" Simulator for Sepsis Treatment | Dec 15, 2019 | modelOpenAI Gym | —Unverified | 0 |
| High dimensional precision medicine from patient-derived xenografts | Dec 13, 2019 | Q-LearningVocal Bursts Intensity Prediction | —Unverified | 0 |
| Provably Efficient Reinforcement Learning with Aggregated States | Dec 13, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation | Dec 10, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Value-of-Information based Arbitration between Model-based and Model-free Control | Dec 8, 2019 | Computational Efficiencymodel | —Unverified | 0 |
| Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery | Dec 7, 2019 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning with Non-Markovian Rewards | Dec 5, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Combining Q-Learning and Search with Amortized Value Estimates | Dec 5, 2019 | Q-Learning | —Unverified | 0 |
| A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms | Dec 4, 2019 | Q-Learning | —Unverified | 0 |
| Learning to Dynamically Coordinate Multi-Robot Teams in Graph Attention Networks | Dec 4, 2019 | Combinatorial OptimizationGraph Attention | —Unverified | 0 |
| Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning | Dec 3, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach | Dec 1, 2019 | Q-Learning | —Unverified | 0 |
| Propagating Uncertainty in Reinforcement Learning via Wasserstein Barycenters | Dec 1, 2019 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle | Dec 1, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Privacy-Preserving Q-Learning with Functional Noise in Continuous Spaces | Dec 1, 2019 | Privacy PreservingQ-Learning | CodeCode Available | 0 |
| Neural Temporal-Difference Learning Converges to Global Optima | Dec 1, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Quadratic Q-network for Learning Continuous Control for Autonomous Vehicles | Nov 29, 2019 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| QMR:Q-learning based Multi-objective optimization Routing protocol for Flying Ad Hoc Networks | Nov 27, 2019 | Q-Learning | CodeCode Available | 0 |
| Control-Tutored Reinforcement Learning: an application to the Herding Problem | Nov 26, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Join Query Optimization with Deep Reinforcement Learning Algorithms | Nov 26, 2019 | AttributeDeep Reinforcement Learning | CodeCode Available | 0 |
| Adaptive Modulation and Coding based on Reinforcement Learning for 5G Networks | Nov 25, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control | Nov 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Mitigate Bias in Face Recognition using Skewness-Aware Reinforcement Learning | Nov 25, 2019 | Face RecognitionFairness | —Unverified | 0 |
| Which Channel to Ask My Question? Personalized Customer Service RequestStream Routing using DeepReinforcement Learning | Nov 24, 2019 | ChatbotDeep Reinforcement Learning | —Unverified | 0 |
| Efficient Drone Mobility Support Using Reinforcement Learning | Nov 21, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Quantum Observables for continuous control of the Quantum Approximate Optimization Algorithm via Reinforcement Learning | Nov 21, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Asymptotics of Reinforcement Learning with Neural Networks | Nov 13, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Modelling Bahdanau Attention using Election methods aided by Q-Learning | Nov 10, 2019 | DecoderMachine Translation | —Unverified | 0 |
| Two-stage WECC Composite Load Modeling: A Double Deep Q-Learning Networks Approach | Nov 8, 2019 | Q-Learning | —Unverified | 0 |
| Challenging On Car Racing Problem from OpenAI gym | Nov 2, 2019 | Car Racingcontinuous-control | —Unverified | 0 |
| On Solving the 2-Dimensional Greedy Shooter Problem for UAVs | Nov 2, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Generalized Speedy Q-learning | Nov 1, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning | Oct 28, 2019 | General Reinforcement LearningQ-Learning | —Unverified | 0 |
| Biomimetic Ultra-Broadband Perfect Absorbers Optimised with Reinforcement Learning | Oct 28, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Oct 27, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| D-Point Trigonometric Path Planning based on Q-Learning in Uncertain Environments | Oct 26, 2019 | PositionQ-Learning | —Unverified | 0 |
| ZPD Teaching Strategies for Deep Reinforcement Learning from Demonstrations | Oct 26, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Q-Learning for Same-Day Delivery with Vehicles and Drones | Oct 25, 2019 | Decision MakingQ-Learning | —Unverified | 0 |
| Momentum-based Accelerated Q-learning | Oct 23, 2019 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Partially Detected Intelligent Traffic Signal Control: Environmental Adaptation | Oct 23, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Policy Learning for Malaria Control | Oct 20, 2019 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Reverse Experience Replay | Oct 19, 2019 | Q-Learning | —Unverified | 0 |
| Automatic Data Augmentation by Learning the Deterministic Policy | Oct 18, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 |
| Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces | Oct 17, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference | Oct 15, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| On the Reduction of Variance and Overestimation of Deep Q-Learning | Oct 14, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Zap Q-Learning With Nonlinear Function Approximation | Oct 11, 2019 | OpenAI GymQ-Learning | —Unverified | 0 |
| Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments | Oct 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |