| Q-CP: Learning Action Values for Cooperative Planning | Mar 1, 2018 | Model-based Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods | Feb 28, 2018 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Variance Reduction Methods for Sublinear Reinforcement Learning | Feb 26, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Addressing Function Approximation Error in Actor-Critic Methods | Feb 26, 2018 | Continuous ControlOpenAI Gym | CodeCode Available | 1 |
| Temporal Difference Models: Model-Free Deep RL for Model-Based Control | Feb 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments | Feb 23, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management | Feb 18, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| A Deep Q-Learning Agent for the L-Game with Variable Batch Training | Feb 17, 2018 | Q-LearningSelf-Learning | CodeCode Available | 0 |
| Monte Carlo Q-learning for General Game Playing | Feb 16, 2018 | Board GamesQ-Learning | CodeCode Available | 0 |
| Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays | Feb 15, 2018 | HippocampusQ-Learning | —Unverified | 0 |
| Mean Field Multi-Agent Reinforcement Learning | Feb 15, 2018 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Q-learning with Nearest Neighbors | Feb 12, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search | Feb 12, 2018 | Knowledge Base CompletionLink Prediction | —Unverified | 0 |
| Balancing Two-Player Stochastic Games with Soft Q-Learning | Feb 9, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning using Capsules in Advanced Game Environments | Jan 29, 2018 | Deep Reinforcement LearningGeneral Classification | —Unverified | 0 |
| Using deep Q-learning to understand the tax evasion behavior of risk-averse firms | Jan 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| The QLBS Q-Learner Goes NuQLear: Fitted Q Iteration, Inverse RL, and Option Portfolios | Jan 17, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Fuzzing | Jan 14, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation | Jan 9, 2018 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 0 |
| Trading the Twitter Sentiment with Reinforcement Learning | Jan 7, 2018 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 |
| Faster Deep Q-learning using Neural Episodic Control | Jan 6, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | Jan 4, 2018 | Continuous ControlDecision Making | CodeCode Available | 1 |
| ViZDoom: DRQN with Prioritized Experience Replay, Double-Q Learning, & Snapshot Ensembling | Jan 3, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| ScreenerNet: Learning Self-Paced Curriculum for Deep Neural Networks | Jan 3, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Autonomous Vehicle Fleet Coordination With Deep Reinforcement Learning | Jan 1, 2018 | Autonomous VehiclesDecision Making | —Unverified | 0 |