| Learning Gaussian Policies from Smoothed Action Value Functions | Jan 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Representing Entropy : A short proof of the equivalence between soft Q-learning and policy gradients | Jan 1, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| TD Learning with Constrained Gradients | Jan 1, 2018 | Q-Learning | —Unverified | 0 |
| Avoiding Catastrophic States with Intrinsic Fear | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation | Dec 29, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A short variational proof of equivalence between policy gradients and soft Q learning | Dec 22, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Scale-invariant temporal history (SITH): optimal slicing of the past in an uncertain world | Dec 19, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning | Dec 18, 2017 | Deep Reinforcement LearningEvolutionary Algorithms | CodeCode Available | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Towards a Deep Reinforcement Learning Approach for Tower Line Wars | Dec 17, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds | Dec 13, 2017 | BenchmarkingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Robust Deep Reinforcement Learning with Adversarial Attacks | Dec 11, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Assumed Density Filtering Q-learning | Dec 9, 2017 | Atari GamesBayesian Inference | CodeCode Available | 0 |
| Deep Primal-Dual Reinforcement Learning: Accelerating Actor-Critic using Bellman Duality | Dec 7, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Zap Q-Learning | Dec 1, 2017 | Q-Learning | —Unverified | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Curriculum Q-Learning for Visual Vocabulary Acquisition | Nov 29, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A reinforcement learning algorithm for building collaboration in multi-agent systems | Nov 28, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Classification with Costly Features using Deep Reinforcement Learning | Nov 20, 2017 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction | Nov 18, 2017 | parameter estimationQ-Learning | —Unverified | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Nov 15, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| A unified decision making framework for supply and demand management in microgrid networks | Nov 14, 2017 | Decision MakingManagement | —Unverified | 0 |
| Double Q(σ) and Q(σ, λ): Unifying Reinforcement Learning Control Algorithms | Nov 5, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| The Effects of Memory Replay in Reinforcement Learning | Oct 18, 2017 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations | Oct 10, 2017 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 |