| Using deep Q-learning to understand the tax evasion behavior of risk-averse firms | Jan 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning using Capsules in Advanced Game Environments | Jan 29, 2018 | Deep Reinforcement LearningGeneral Classification | —Unverified | 0 |
| The QLBS Q-Learner Goes NuQLear: Fitted Q Iteration, Inverse RL, and Option Portfolios | Jan 17, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Fuzzing | Jan 14, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation | Jan 9, 2018 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 0 |
| Trading the Twitter Sentiment with Reinforcement Learning | Jan 7, 2018 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| Faster Deep Q-learning using Neural Episodic Control | Jan 6, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| ScreenerNet: Learning Self-Paced Curriculum for Deep Neural Networks | Jan 3, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| ViZDoom: DRQN with Prioritized Experience Replay, Double-Q Learning, & Snapshot Ensembling | Jan 3, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Learning Gaussian Policies from Smoothed Action Value Functions | Jan 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| TD Learning with Constrained Gradients | Jan 1, 2018 | Q-Learning | —Unverified | 0 |
| Avoiding Catastrophic States with Intrinsic Fear | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Autonomous Vehicle Fleet Coordination With Deep Reinforcement Learning | Jan 1, 2018 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Representing Entropy : A short proof of the equivalence between soft Q-learning and policy gradients | Jan 1, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation | Dec 29, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A short variational proof of equivalence between policy gradients and soft Q learning | Dec 22, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Scale-invariant temporal history (SITH): optimal slicing of the past in an uncertain world | Dec 19, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning | Dec 18, 2017 | Deep Reinforcement LearningEvolutionary Algorithms | CodeCode Available | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Towards a Deep Reinforcement Learning Approach for Tower Line Wars | Dec 17, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds | Dec 13, 2017 | BenchmarkingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Robust Deep Reinforcement Learning with Adversarial Attacks | Dec 11, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Assumed Density Filtering Q-learning | Dec 9, 2017 | Atari GamesBayesian Inference | CodeCode Available | 0 |
| Deep Primal-Dual Reinforcement Learning: Accelerating Actor-Critic using Bellman Duality | Dec 7, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |