| Supervised Q-walk for Learning Vector Representation of Nodes in Networks | Oct 3, 2017 | ClassificationGeneral Classification | —Unverified | 0 |
| Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation | Sep 29, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| A Simple Reinforcement Learning Mechanism for Resource Allocation in LTE-A Networks with Markov Decision Process and Q-Learning | Sep 27, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Optimal Online Method of Selecting Source Policies for Reinforcement Learning | Sep 24, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Improving Search through A3C Reinforcement Learning based Conversational Agent | Sep 17, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Automated Cloud Provisioning on AWS using Deep Reinforcement Learning | Sep 13, 2017 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| BIBI System Description: Building with CNNs and Breaking with Deep Reinforcement Learning | Sep 1, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Constructing narrative using a generative model and continuous action policies | Sep 1, 2017 | Paraphrase IdentificationQ-Learning | —Unverified | 0 |
| Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids | Aug 25, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Practical Block-wise Neural Network Architecture Generation | Aug 18, 2017 | image-classificationImage Classification | CodeCode Available | 0 |
| Investigating Reinforcement Learning Agents for Continuous State Space Environments | Aug 8, 2017 | OpenAI GymQ-Learning | —Unverified | 0 |
| Guiding Reinforcement Learning Exploration Using Natural Language | Jul 26, 2017 | DecoderMachine Translation | —Unverified | 0 |
| On-line Building Energy Optimization using Deep Reinforcement Learning | Jul 18, 2017 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Empirical evaluation of a Q-Learning Algorithm for Model-free Autonomous Soaring | Jul 18, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Fastest Convergence for Q-learning | Jul 12, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Q-Learning Algorithm for VoLTE Closed-Loop Power Control in Indoor Small Cells | Jul 10, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deep Q-Learning for Self-Organizing Networks Fault Management and Radio Performance Improvement | Jul 10, 2017 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning | Jun 22, 2017 | Action DetectionPosition | CodeCode Available | 0 |
| Reinforcement Learning under Model Mismatch | Jun 15, 2017 | modelQ-Learning | —Unverified | 0 |
| Learning to Learn from Noisy Web Videos | Jun 9, 2017 | Action RecognitionQ-Learning | —Unverified | 0 |
| Generalized Value Iteration Networks: Life Beyond Lattices | Jun 8, 2017 | Q-Learning | CodeCode Available | 0 |
| Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments | Jun 7, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| UCB Exploration via Q-Ensembles | Jun 5, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Implications of Decentralized Q-learning Resource Allocation in Wireless Networks | May 30, 2017 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning | May 20, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Comparison of Reinforcement Learning Techniques for Fuzzy Cloud Auto-Scaling | May 19, 2017 | ManagementQ-Learning | —Unverified | 0 |
| Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering | May 17, 2017 | ClusteringQ-Learning | —Unverified | 0 |
| Learning to Represent Haptic Feedback for Partially-Observable Tasks | May 17, 2017 | Q-Learning | —Unverified | 0 |
| Learning Hard Alignments with Variational Inference | May 16, 2017 | Hard AttentionImage Captioning | —Unverified | 0 |
| Discrete Sequential Prediction of Continuous Actions for Deep RL | May 14, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Iterations for Reinforcement Learning Problems in Continuous Time and Space -- Fundamental Theory and Methods | May 9, 2017 | Decision MakingQ-Learning | CodeCode Available | 0 |
| Deep Episodic Value Iteration for Model-based Meta-Reinforcement Learning | May 9, 2017 | Meta Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Equivalence Between Policy Gradients and Soft Q-Learning | Apr 21, 2017 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads | Apr 20, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Q-learning from Demonstrations | Apr 12, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Data-efficient Deep Reinforcement Learning for Dexterous Manipulation | Apr 10, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Pseudorehearsal in value function approximation | Mar 21, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Online Learning for Offloading and Autoscaling in Energy Harvesting Mobile Edge Computing | Mar 17, 2017 | Edge-computingManagement | —Unverified | 0 |
| Evolution Strategies as a Scalable Alternative to Reinforcement Learning | Mar 10, 2017 | Atari GamesMuJoCo | CodeCode Available | 1 |
| Multi-step Reinforcement Learning: A Unifying Algorithm | Mar 3, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Bridging the Gap Between Value and Policy Based Reinforcement Learning | Feb 28, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning | Feb 28, 2017 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Reinforcement Learning with Deep Energy-Based Policies | Feb 27, 2017 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Learning Control for Air Hockey Striking using Deep Reinforcement Learning | Feb 26, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Collaborative Deep Reinforcement Learning for Joint Object Search | Feb 18, 2017 | Active Object LocalizationDeep Reinforcement Learning | —Unverified | 0 |
| The Game Imitation: Deep Supervised Convolutional Networks for Quick Video Game AI | Feb 18, 2017 | Decision MakingImitation Learning | —Unverified | 0 |
| FPGA Architecture for Deep Learning and its application to Planetary Robotics | Jan 26, 2017 | CPUQ-Learning | —Unverified | 0 |
| Learning to predict where to look in interactive environments using deep recurrent q-learning | Dec 17, 2016 | Atari GamesQ-Learning | —Unverified | 0 |
| Playing Doom with SLAM-Augmented Deep Reinforcement Learning | Dec 1, 2016 | Deep Reinforcement Learningobject-detection | CodeCode Available | 0 |
| Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic | Nov 7, 2016 | continuous-controlContinuous Control | CodeCode Available | 0 |