| A Generalized Minimax Q-learning Algorithm for Two-Player Zero-Sum Stochastic Games | Jun 16, 2019 | Q-Learning | —Unverified | 0 |
| Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle | Jun 14, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Variance-reduced Q-learning is minimax optimal | Jun 11, 2019 | Q-Learning | —Unverified | 0 |
| Deep Reinforcement Learning with Discrete Normalized Advantage Functions for Resource Management in Network Slicing | Jun 10, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| "Did You Hear That?" Learning to Play Video Games from Audio Cues | Jun 10, 2019 | Game DesignNavigate | —Unverified | 0 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Escaping the State of Nature: A Hobbesian Approach to Cooperation in Multi-agent Reinforcement Learning | Jun 5, 2019 | Multi-agent Reinforcement LearningPhilosophy | —Unverified | 0 |
| Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning | Jun 5, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Risk-Sensitive Compact Decision Trees for Autonomous Execution in Presence of Simulated Market Response | Jun 5, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Q-Learning for Directed Acyclic Graph Generation | Jun 5, 2019 | Deep Reinforcement LearningGraph Generation | —Unverified | 0 |
| On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection | Jun 4, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning with Low-Complexity Liquid State Machines | Jun 4, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction | Jun 3, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Feature-Based Q-Learning for Two-Player Stochastic Games | Jun 2, 2019 | Q-LearningVocal Bursts Valence Prediction | —Unverified | 0 |
| RSS-Based Q-Learning for Indoor UAV Navigation | May 31, 2019 | Q-Learning | —Unverified | 0 |
| Provably Efficient Q-Learning with Low Switching Cost | May 30, 2019 | Q-Learning | —Unverified | 0 |
| Learning NP-Hard Multi-Agent Assignment Planning using GNN: Inference on a Random Graph and Provable Auction-Fitted Q-learning | May 29, 2019 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology | May 29, 2019 | Q-LearningRecommendation Systems | —Unverified | 0 |
| A General Markov Decision Process Framework for Directly Learning Optimal Control Policies | May 28, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Solving NP-Hard Problems on Graphs with Extended AlphaGo Zero | May 28, 2019 | Combinatorial OptimizationGraph Neural Network | CodeCode Available | 0 |
| Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning | May 27, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards | May 27, 2019 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Prioritized Sequence Experience Replay | May 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Kernel Loss for Solving the Bellman Equation | May 25, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| MQLV: Optimal Policy of Money Management in Retail Banking with Q-Learning | May 24, 2019 | Decision MakingManagement | —Unverified | 0 |
| Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima | May 24, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Adaptive Symmetric Reward Noising for Reinforcement Learning | May 24, 2019 | Autonomous DrivingQ-Learning | CodeCode Available | 0 |
| Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment | May 23, 2019 | OpenAI GymQ-Learning | —Unverified | 0 |
| Stochastic Variance Reduction for Deep Q-learning | May 20, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning Based Parameter Control in Differential Evolution | May 20, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning for Learning of Dynamical Systems in Uncertain Environment: a Tutorial | May 19, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature Selection | May 16, 2019 | feature selectionMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning for Robotics and Control with Active Uncertainty Reduction | May 15, 2019 | ManagementOpenAI Gym | —Unverified | 0 |
| Autonomous Penetration Testing using Reinforcement Learning | May 15, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Stochastic approximation with cone-contractive operators: Sharp _-bounds for Q-learning | May 15, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Domain Adversarial Reinforcement Learning for Partial Domain Adaptation | May 10, 2019 | Domain AdaptationPartial Domain Adaptation | —Unverified | 0 |
| Design of Artificial Intelligence Agents for Games using Deep Reinforcement Learning | May 10, 2019 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Pretrain Soft Q-Learning with Imperfect Demonstrations | May 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Reinforcement Learning Perspective on the Optimal Control of Mutation Probabilities for the (1+1) Evolutionary Algorithm: First Results on the OneMax Problem | May 9, 2019 | Evolutionary AlgorithmsQ-Learning | —Unverified | 0 |
| Toward Packet Routing with Fully-distributed Multi-agent Deep Reinforcement Learning | May 9, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Accelerated Target Updates for Q-learning | May 7, 2019 | Atari GamesQ-Learning | —Unverified | 0 |
| Comprehensible Context-driven Text Game Playing | May 6, 2019 | Q-Learning | CodeCode Available | 0 |
| Deep Ordinal Reinforcement Learning | May 6, 2019 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Efficient Model-free Reinforcement Learning in Metric Spaces | May 1, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Learning agents with prioritization and parameter noise in continuous state and action space | May 1, 2019 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Two-Timescale Networks for Nonlinear Value Function Approximation | May 1, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Soft Q-Learning with Mutual-Information Regularization | May 1, 2019 | Decision MakingQ-Learning | —Unverified | 0 |
| A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks | Apr 30, 2019 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Zap Q-Learning for Optimal Stopping Time Problems | Apr 25, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Target-Based Temporal Difference Learning | Apr 24, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |