| Robbins-Monro conditions for persistent exploration learning strategies | Aug 1, 2018 | Q-Learning | —Unverified | 0 |
| A Reinforcement Learning Approach to Target Tracking in a Camera Network | Jul 26, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Variational Bayesian Reinforcement Learning with Regret Bounds | Jul 25, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors | Jul 22, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Remember and Forget for Experience Replay | Jul 16, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms | Jul 16, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Is Q-learning Provably Efficient? | Jul 10, 2018 | Q-LearningReinforcement Learning | CodeCode Available | 1 |
| Video Summarisation by Classification with Deep Reinforcement Learning | Jul 9, 2018 | ClassificationDecision Making | —Unverified | 0 |
| Playing against Nature: causal discovery for decision making under uncertainty | Jul 3, 2018 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Learning to Coordinate with Coordination Graphs in Repeated Single-Stage Multi-Agent Decision Problems | Jul 1, 2018 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning | Jul 1, 2018 | Hierarchical Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Learning to Explore via Meta-Policy Gradient | Jul 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Many-Goals Reinforcement Learning | Jun 22, 2018 | AllQ-Learning | —Unverified | 0 |
| Reinforcement Learning using Augmented Neural Networks | Jun 20, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Action Learning for 3D Point Cloud Based Organ Segmentation | Jun 14, 2018 | Organ SegmentationQ-Learning | —Unverified | 0 |
| Automatic formation of the structure of abstract machines in hierarchical reinforcement learning with state clustering | Jun 13, 2018 | ClusteringHierarchical Reinforcement Learning | —Unverified | 0 |
| Distributional Advantage Actor-Critic | Jun 10, 2018 | Q-Learningquantile regression | —Unverified | 0 |
| Fidelity-based Probabilistic Q-learning for Control of Quantum Systems | Jun 8, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation | Jun 6, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning | Jun 1, 2018 | Hyperparameter OptimizationObject Tracking | —Unverified | 0 |
| Depth and nonlinearity induce implicit exploration for RL | May 29, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hierarchical clustering with deep Q-learning | May 28, 2018 | ClusteringQ-Learning | —Unverified | 0 |
| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms | May 23, 2018 | Efficient ExplorationQ-Learning | —Unverified | 0 |
| Learning Sampling Policies for Domain Adaptation | May 19, 2018 | ClassificationDomain Adaptation | —Unverified | 0 |
| Algorithmic Trading with Fitted Q Iteration and Heston Model | May 18, 2018 | Algorithmic TradingQ-Learning | —Unverified | 0 |
| GAN Q-learning | May 13, 2018 | Distributional Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Stochastic Approximation for Risk-aware Markov Decision Processes | May 11, 2018 | Q-Learning | —Unverified | 0 |
| Planning and Learning with Stochastic Action Sets | May 7, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Hybrid Q-Learning Sine-Cosine-based Strategy for Addressing the Combinatorial Test Suite Minimization Problem | Apr 27, 2018 | Q-Learning | —Unverified | 0 |
| Multiagent Soft Q-Learning | Apr 25, 2018 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Towards Symbolic Reinforcement Learning with Common Sense | Apr 23, 2018 | Common Sense ReasoningDeep Reinforcement Learning | CodeCode Available | 0 |
| Benchmarking projective simulation in navigation problems | Apr 23, 2018 | BenchmarkingQ-Learning | —Unverified | 0 |
| State Distribution-aware Sampling for Deep Q-learning | Apr 23, 2018 | Atari GamesOpenAI Gym | —Unverified | 0 |
| Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems | Apr 19, 2018 | Q-LearningStochastic Optimization | CodeCode Available | 0 |
| Reinforced Co-Training | Apr 17, 2018 | Clickbait DetectionGeneral Classification | —Unverified | 0 |
| State-Augmentation Transformations for Risk-Sensitive Reinforcement Learning | Apr 16, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| CytonRL: an Efficient Reinforcement Learning Open-source Toolkit Implemented in C++ | Apr 14, 2018 | GPUQ-Learning | CodeCode Available | 0 |
| Hierarchical Modular Reinforcement Learning Method and Knowledge Acquisition of State-Action Rule for Multi-target Problem | Apr 8, 2018 | PositionQ-Learning | —Unverified | 0 |
| Information Maximizing Exploration with a Latent Dynamics Model | Apr 4, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator | Apr 1, 2018 | Information RetrievalQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks | Mar 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning | Mar 27, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Natural Gradient Deep Q-learning | Mar 20, 2018 | Deep Reinforcement LearningHyperparameter Optimization | —Unverified | 0 |
| Composable Deep Reinforcement Learning for Robotic Manipulation | Mar 19, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Learning to Explore with Meta-Policy Gradient | Mar 13, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Multi-Armed Bandits for Correlated Markovian Environments with Smoothed Reward Feedback | Mar 11, 2018 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Deep reinforcement learning for time series: playing idealized trading games | Mar 11, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| SA-IGA: A Multiagent Reinforcement Learning Method Towards Socially Optimal Outcomes | Mar 8, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Smoothed Action Value Functions for Learning Gaussian Policies | Mar 6, 2018 | continuous-controlContinuous Control | —Unverified | 0 |