| State-Augmentation Transformations for Risk-Sensitive Reinforcement Learning | Apr 16, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| CytonRL: an Efficient Reinforcement Learning Open-source Toolkit Implemented in C++ | Apr 14, 2018 | GPUQ-Learning | CodeCode Available | 0 |
| Hierarchical Modular Reinforcement Learning Method and Knowledge Acquisition of State-Action Rule for Multi-target Problem | Apr 8, 2018 | PositionQ-Learning | —Unverified | 0 |
| Information Maximizing Exploration with a Latent Dynamics Model | Apr 4, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator | Apr 1, 2018 | Information RetrievalQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks | Mar 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Natural Gradient Deep Q-learning | Mar 20, 2018 | Deep Reinforcement LearningHyperparameter Optimization | —Unverified | 0 |
| Composable Deep Reinforcement Learning for Robotic Manipulation | Mar 19, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Learning to Explore with Meta-Policy Gradient | Mar 13, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deep reinforcement learning for time series: playing idealized trading games | Mar 11, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Multi-Armed Bandits for Correlated Markovian Environments with Smoothed Reward Feedback | Mar 11, 2018 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| SA-IGA: A Multiagent Reinforcement Learning Method Towards Socially Optimal Outcomes | Mar 8, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Smoothed Action Value Functions for Learning Gaussian Policies | Mar 6, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Q-CP: Learning Action Values for Cooperative Planning | Mar 1, 2018 | Model-based Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods | Feb 28, 2018 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Variance Reduction Methods for Sublinear Reinforcement Learning | Feb 26, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Temporal Difference Models: Model-Free Deep RL for Model-Based Control | Feb 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments | Feb 23, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management | Feb 18, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| A Deep Q-Learning Agent for the L-Game with Variable Batch Training | Feb 17, 2018 | Q-LearningSelf-Learning | CodeCode Available | 0 |
| Monte Carlo Q-learning for General Game Playing | Feb 16, 2018 | Board GamesQ-Learning | CodeCode Available | 0 |
| Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays | Feb 15, 2018 | HippocampusQ-Learning | —Unverified | 0 |
| Q-learning with Nearest Neighbors | Feb 12, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search | Feb 12, 2018 | Knowledge Base CompletionLink Prediction | —Unverified | 0 |
| Balancing Two-Player Stochastic Games with Soft Q-Learning | Feb 9, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Using deep Q-learning to understand the tax evasion behavior of risk-averse firms | Jan 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning using Capsules in Advanced Game Environments | Jan 29, 2018 | Deep Reinforcement LearningGeneral Classification | —Unverified | 0 |
| The QLBS Q-Learner Goes NuQLear: Fitted Q Iteration, Inverse RL, and Option Portfolios | Jan 17, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Fuzzing | Jan 14, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation | Jan 9, 2018 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 0 |
| Trading the Twitter Sentiment with Reinforcement Learning | Jan 7, 2018 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| Faster Deep Q-learning using Neural Episodic Control | Jan 6, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| ScreenerNet: Learning Self-Paced Curriculum for Deep Neural Networks | Jan 3, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| ViZDoom: DRQN with Prioritized Experience Replay, Double-Q Learning, & Snapshot Ensembling | Jan 3, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Learning Gaussian Policies from Smoothed Action Value Functions | Jan 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| TD Learning with Constrained Gradients | Jan 1, 2018 | Q-Learning | —Unverified | 0 |
| Avoiding Catastrophic States with Intrinsic Fear | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Autonomous Vehicle Fleet Coordination With Deep Reinforcement Learning | Jan 1, 2018 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Representing Entropy : A short proof of the equivalence between soft Q-learning and policy gradients | Jan 1, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation | Dec 29, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A short variational proof of equivalence between policy gradients and soft Q learning | Dec 22, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Scale-invariant temporal history (SITH): optimal slicing of the past in an uncertain world | Dec 19, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning | Dec 18, 2017 | Deep Reinforcement LearningEvolutionary Algorithms | CodeCode Available | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Towards a Deep Reinforcement Learning Approach for Tower Line Wars | Dec 17, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds | Dec 13, 2017 | BenchmarkingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Robust Deep Reinforcement Learning with Adversarial Attacks | Dec 11, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Assumed Density Filtering Q-learning | Dec 9, 2017 | Atari GamesBayesian Inference | CodeCode Available | 0 |
| Deep Primal-Dual Reinforcement Learning: Accelerating Actor-Critic using Bellman Duality | Dec 7, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |