| Q-CP: Learning Action Values for Cooperative Planning | Mar 1, 2018 | Model-based Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods | Feb 28, 2018 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Variance Reduction Methods for Sublinear Reinforcement Learning | Feb 26, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Addressing Function Approximation Error in Actor-Critic Methods | Feb 26, 2018 | Continuous ControlOpenAI Gym | CodeCode Available | 1 |
| Temporal Difference Models: Model-Free Deep RL for Model-Based Control | Feb 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments | Feb 23, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management | Feb 18, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| A Deep Q-Learning Agent for the L-Game with Variable Batch Training | Feb 17, 2018 | Q-LearningSelf-Learning | CodeCode Available | 0 |
| Monte Carlo Q-learning for General Game Playing | Feb 16, 2018 | Board GamesQ-Learning | CodeCode Available | 0 |
| Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays | Feb 15, 2018 | HippocampusQ-Learning | —Unverified | 0 |
| Mean Field Multi-Agent Reinforcement Learning | Feb 15, 2018 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Q-learning with Nearest Neighbors | Feb 12, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search | Feb 12, 2018 | Knowledge Base CompletionLink Prediction | —Unverified | 0 |
| Balancing Two-Player Stochastic Games with Soft Q-Learning | Feb 9, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning using Capsules in Advanced Game Environments | Jan 29, 2018 | Deep Reinforcement LearningGeneral Classification | —Unverified | 0 |
| Using deep Q-learning to understand the tax evasion behavior of risk-averse firms | Jan 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| The QLBS Q-Learner Goes NuQLear: Fitted Q Iteration, Inverse RL, and Option Portfolios | Jan 17, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Fuzzing | Jan 14, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation | Jan 9, 2018 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 0 |
| Trading the Twitter Sentiment with Reinforcement Learning | Jan 7, 2018 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| Faster Deep Q-learning using Neural Episodic Control | Jan 6, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | Jan 4, 2018 | Continuous ControlDecision Making | CodeCode Available | 1 |
| ViZDoom: DRQN with Prioritized Experience Replay, Double-Q Learning, & Snapshot Ensembling | Jan 3, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| ScreenerNet: Learning Self-Paced Curriculum for Deep Neural Networks | Jan 3, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Autonomous Vehicle Fleet Coordination With Deep Reinforcement Learning | Jan 1, 2018 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Learning Gaussian Policies from Smoothed Action Value Functions | Jan 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Representing Entropy : A short proof of the equivalence between soft Q-learning and policy gradients | Jan 1, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| TD Learning with Constrained Gradients | Jan 1, 2018 | Q-Learning | —Unverified | 0 |
| Avoiding Catastrophic States with Intrinsic Fear | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation | Dec 29, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A short variational proof of equivalence between policy gradients and soft Q learning | Dec 22, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Scale-invariant temporal history (SITH): optimal slicing of the past in an uncertain world | Dec 19, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning | Dec 18, 2017 | Deep Reinforcement LearningEvolutionary Algorithms | CodeCode Available | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Towards a Deep Reinforcement Learning Approach for Tower Line Wars | Dec 17, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds | Dec 13, 2017 | BenchmarkingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Robust Deep Reinforcement Learning with Adversarial Attacks | Dec 11, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Assumed Density Filtering Q-learning | Dec 9, 2017 | Atari GamesBayesian Inference | CodeCode Available | 0 |
| Deep Primal-Dual Reinforcement Learning: Accelerating Actor-Critic using Bellman Duality | Dec 7, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Zap Q-Learning | Dec 1, 2017 | Q-Learning | —Unverified | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Curriculum Q-Learning for Visual Vocabulary Acquisition | Nov 29, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A reinforcement learning algorithm for building collaboration in multi-agent systems | Nov 28, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Classification with Costly Features using Deep Reinforcement Learning | Nov 20, 2017 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction | Nov 18, 2017 | parameter estimationQ-Learning | —Unverified | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Nov 15, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| A unified decision making framework for supply and demand management in microgrid networks | Nov 14, 2017 | Decision MakingManagement | —Unverified | 0 |
| Double Q(σ) and Q(σ, λ): Unifying Reinforcement Learning Control Algorithms | Nov 5, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| The Effects of Memory Replay in Reinforcement Learning | Oct 18, 2017 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations | Oct 10, 2017 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 |