| Learning through Probing: a decentralized reinforcement learning architecture for social dilemmas | Sep 26, 2018 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals | Sep 25, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Target Transfer Q-Learning and Its Convergence Analysis | Sep 21, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Model-Free Adaptive Optimal Control of Episodic Fixed-Horizon Manufacturing Processes using Reinforcement Learning | Sep 18, 2018 | Model Predictive ControlQ-Learning | CodeCode Available | 0 |
| Optimal Matrix Momentum Stochastic Approximation and Applications to Q-learning | Sep 17, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process | Sep 17, 2018 | Q-Learning | —Unverified | 0 |
| Deterministic Implementations for Reproducibility in Deep Reinforcement Learning | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Sampled Policy Gradient for Learning to Play the Game Agar.io | Sep 15, 2018 | Game DesignQ-Learning | CodeCode Available | 0 |
| Towards Better Interpretability in Deep Q-Networks | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Directed Exploration in PAC Model-Free Reinforcement Learning | Aug 31, 2018 | Efficient Explorationmodel | —Unverified | 0 |
| MARL-FWC: Optimal Coordination of Freeway Traffic Control Measures | Aug 27, 2018 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| BlockQNN: Efficient Block-wise Neural Network Architecture Generation | Aug 16, 2018 | GPUimage-classification | CodeCode Available | 0 |
| Automatic Derivation Of Formulas Using Reforcement Learning | Aug 15, 2018 | Q-Learning | —Unverified | 0 |
| A Framework for Automated Cellular Network Tuning with Reinforcement Learning | Aug 13, 2018 | ManagementQ-Learning | CodeCode Available | 0 |
| Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks | Aug 1, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Robbins-Monro conditions for persistent exploration learning strategies | Aug 1, 2018 | Q-Learning | —Unverified | 0 |
| A Reinforcement Learning Approach to Target Tracking in a Camera Network | Jul 26, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Variational Bayesian Reinforcement Learning with Regret Bounds | Jul 25, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors | Jul 22, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms | Jul 16, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Remember and Forget for Experience Replay | Jul 16, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Video Summarisation by Classification with Deep Reinforcement Learning | Jul 9, 2018 | ClassificationDecision Making | —Unverified | 0 |
| Playing against Nature: causal discovery for decision making under uncertainty | Jul 3, 2018 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Learning to Explore via Meta-Policy Gradient | Jul 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning | Jul 1, 2018 | Hierarchical Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Learning to Coordinate with Coordination Graphs in Repeated Single-Stage Multi-Agent Decision Problems | Jul 1, 2018 | Multi-Armed BanditsQ-Learning | —Unverified | 0 |
| Many-Goals Reinforcement Learning | Jun 22, 2018 | AllQ-Learning | —Unverified | 0 |
| Reinforcement Learning using Augmented Neural Networks | Jun 20, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Action Learning for 3D Point Cloud Based Organ Segmentation | Jun 14, 2018 | Organ SegmentationQ-Learning | —Unverified | 0 |
| Automatic formation of the structure of abstract machines in hierarchical reinforcement learning with state clustering | Jun 13, 2018 | ClusteringHierarchical Reinforcement Learning | —Unverified | 0 |
| Distributional Advantage Actor-Critic | Jun 10, 2018 | Q-Learningquantile regression | —Unverified | 0 |
| Fidelity-based Probabilistic Q-learning for Control of Quantum Systems | Jun 8, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation | Jun 6, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning | Jun 1, 2018 | Hyperparameter OptimizationObject Tracking | —Unverified | 0 |
| Depth and nonlinearity induce implicit exploration for RL | May 29, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Hierarchical clustering with deep Q-learning | May 28, 2018 | ClusteringQ-Learning | —Unverified | 0 |
| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms | May 23, 2018 | Efficient ExplorationQ-Learning | —Unverified | 0 |
| Learning Sampling Policies for Domain Adaptation | May 19, 2018 | ClassificationDomain Adaptation | —Unverified | 0 |
| Algorithmic Trading with Fitted Q Iteration and Heston Model | May 18, 2018 | Algorithmic TradingQ-Learning | —Unverified | 0 |
| GAN Q-learning | May 13, 2018 | Distributional Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Stochastic Approximation for Risk-aware Markov Decision Processes | May 11, 2018 | Q-Learning | —Unverified | 0 |
| Planning and Learning with Stochastic Action Sets | May 7, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Hybrid Q-Learning Sine-Cosine-based Strategy for Addressing the Combinatorial Test Suite Minimization Problem | Apr 27, 2018 | Q-Learning | —Unverified | 0 |
| Multiagent Soft Q-Learning | Apr 25, 2018 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Benchmarking projective simulation in navigation problems | Apr 23, 2018 | BenchmarkingQ-Learning | —Unverified | 0 |
| Towards Symbolic Reinforcement Learning with Common Sense | Apr 23, 2018 | Common Sense ReasoningDeep Reinforcement Learning | CodeCode Available | 0 |
| State Distribution-aware Sampling for Deep Q-learning | Apr 23, 2018 | Atari GamesOpenAI Gym | —Unverified | 0 |
| Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems | Apr 19, 2018 | Q-LearningStochastic Optimization | CodeCode Available | 0 |
| Reinforced Co-Training | Apr 17, 2018 | Clickbait DetectionGeneral Classification | —Unverified | 0 |