| Active Deep Q-learning with Demonstration | Dec 6, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Revisiting the Softmax Bellman Operator: New Benefits and New Perspective | Dec 2, 2018 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Non-delusional Q-learning and value-iteration | Dec 1, 2018 | Q-Learning | —Unverified | 0 |
| Urban Driving with Multi-Objective Deep Reinforcement Learning | Nov 21, 2018 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 0 |
| Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning | Nov 19, 2018 | Active LearningQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning with A* and a Deep Heuristic | Nov 19, 2018 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Emergence of Addictive Behaviors in Reinforcement Learning Agents | Nov 14, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Q learning for fooling neural networks | Nov 13, 2018 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Managing App Install Ad Campaigns in RTB: A Q-Learning Approach | Nov 11, 2018 | Q-Learning | —Unverified | 0 |
| Deep Reinforcement Learning via L-BFGS Optimization | Nov 6, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Green Security Games with Real-Time Information | Nov 6, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning based Dynamic Model Selection for Short-Term Load Forecasting | Nov 5, 2018 | BIG-bench Machine LearningLoad Forecasting | —Unverified | 0 |
| Double Q-PID algorithm for mobile robot control | Nov 1, 2018 | Active LearningQ-Learning | CodeCode Available | 0 |
| Approximate Dynamic Oracle for Dependency Parsing with Reinforcement Learning | Nov 1, 2018 | Dependency ParsingImitation Learning | —Unverified | 0 |
| Structure Learning of Deep Neural Networks with Q-Learning | Oct 31, 2018 | image-classificationImage Classification | —Unverified | 0 |
| Distributive Dynamic Spectrum Access through Deep Reinforcement Learning: A Reservoir Computing Based Approach | Oct 28, 2018 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Multi-Agent Reinforcement Learning Based Resource Allocation for UAV Networks | Oct 24, 2018 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning Negotiating Behavior Between Cars in Intersections using Deep Q-Learning | Oct 24, 2018 | Q-Learning | —Unverified | 0 |
| Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement | Oct 22, 2018 | Policy Gradient MethodsQ-Learning | CodeCode Available | 0 |
| Optimization of Molecules via Deep Reinforcement Learning | Oct 19, 2018 | Deep Reinforcement LearningMolecular Graph Generation | CodeCode Available | 1 |
| Finding the best design parameters for optical nanostructures using reinforcement learning | Oct 18, 2018 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| Assessing the Potential of Classical Q-learning in General Game Playing | Oct 14, 2018 | Board GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning to Sketch with Deep Q Networks and Demonstrated Strokes | Oct 14, 2018 | Q-Learning | —Unverified | 0 |
| Learning to Reason | Oct 12, 2018 | Automated Theorem ProvingQ-Learning | —Unverified | 0 |
| Reinforcement Evolutionary Learning Method for self-learning | Oct 7, 2018 | Incremental LearningMarketing | —Unverified | 0 |
| Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks | Oct 6, 2018 | AllMontezuma's Revenge | CodeCode Available | 0 |
| Deep Quality-Value (DQV) Learning | Sep 30, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning in R | Sep 29, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hybrid Policies Using Inverse Rewards for Reinforcement Learning | Sep 27, 2018 | OpenAI GymQ-Learning | —Unverified | 0 |
| What Would pi* Do?: Imitation Learning via Off-Policy Reinforcement Learning | Sep 27, 2018 | Imitation LearningQ-Learning | —Unverified | 0 |
| Accelerated Value Iteration via Anderson Mixing | Sep 27, 2018 | Atari GamesQ-Learning | —Unverified | 0 |
| Convergent Reinforcement Learning with Function Approximation: A Bilevel Optimization Perspective | Sep 27, 2018 | Bilevel OptimizationQ-Learning | —Unverified | 0 |
| A Convergent Variant of the Boltzmann Softmax Operator in Reinforcement Learning | Sep 27, 2018 | Atari GamesQ-Learning | —Unverified | 0 |
| The wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions | Sep 27, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Learning through Probing: a decentralized reinforcement learning architecture for social dilemmas | Sep 26, 2018 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals | Sep 25, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Target Transfer Q-Learning and Its Convergence Analysis | Sep 21, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Model-Free Adaptive Optimal Control of Episodic Fixed-Horizon Manufacturing Processes using Reinforcement Learning | Sep 18, 2018 | Model Predictive ControlQ-Learning | CodeCode Available | 0 |
| Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process | Sep 17, 2018 | Q-Learning | —Unverified | 0 |
| Optimal Matrix Momentum Stochastic Approximation and Applications to Q-learning | Sep 17, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deterministic Implementations for Reproducibility in Deep Reinforcement Learning | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Sampled Policy Gradient for Learning to Play the Game Agar.io | Sep 15, 2018 | Game DesignQ-Learning | CodeCode Available | 0 |
| Towards Better Interpretability in Deep Q-Networks | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Negative Update Intervals in Deep Multi-Agent Reinforcement Learning | Sep 13, 2018 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Directed Exploration in PAC Model-Free Reinforcement Learning | Aug 31, 2018 | Efficient Explorationmodel | —Unverified | 0 |
| MARL-FWC: Optimal Coordination of Freeway Traffic Control Measures | Aug 27, 2018 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| BlockQNN: Efficient Block-wise Neural Network Architecture Generation | Aug 16, 2018 | GPUimage-classification | CodeCode Available | 0 |
| Automatic Derivation Of Formulas Using Reforcement Learning | Aug 15, 2018 | Q-Learning | —Unverified | 0 |
| A Framework for Automated Cellular Network Tuning with Reinforcement Learning | Aug 13, 2018 | ManagementQ-Learning | CodeCode Available | 0 |
| Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks | Aug 1, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |