| Zap Q-Learning | Dec 1, 2017 | Q-Learning | —Unverified | 0 |
| Curriculum Q-Learning for Visual Vocabulary Acquisition | Nov 29, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A reinforcement learning algorithm for building collaboration in multi-agent systems | Nov 28, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Classification with Costly Features using Deep Reinforcement Learning | Nov 20, 2017 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction | Nov 18, 2017 | parameter estimationQ-Learning | —Unverified | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Nov 15, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| A unified decision making framework for supply and demand management in microgrid networks | Nov 14, 2017 | Decision MakingManagement | —Unverified | 0 |
| Double Q(σ) and Q(σ, λ): Unifying Reinforcement Learning Control Algorithms | Nov 5, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| The Effects of Memory Replay in Reinforcement Learning | Oct 18, 2017 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations | Oct 10, 2017 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 |
| Supervised Q-walk for Learning Vector Representation of Nodes in Networks | Oct 3, 2017 | ClassificationGeneral Classification | —Unverified | 0 |
| Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation | Sep 29, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| A Simple Reinforcement Learning Mechanism for Resource Allocation in LTE-A Networks with Markov Decision Process and Q-Learning | Sep 27, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Optimal Online Method of Selecting Source Policies for Reinforcement Learning | Sep 24, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Improving Search through A3C Reinforcement Learning based Conversational Agent | Sep 17, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Constructing narrative using a generative model and continuous action policies | Sep 1, 2017 | Paraphrase IdentificationQ-Learning | —Unverified | 0 |
| BIBI System Description: Building with CNNs and Breaking with Deep Reinforcement Learning | Sep 1, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids | Aug 25, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Practical Block-wise Neural Network Architecture Generation | Aug 18, 2017 | image-classificationImage Classification | CodeCode Available | 0 |
| Investigating Reinforcement Learning Agents for Continuous State Space Environments | Aug 8, 2017 | OpenAI GymQ-Learning | —Unverified | 0 |
| Guiding Reinforcement Learning Exploration Using Natural Language | Jul 26, 2017 | DecoderMachine Translation | —Unverified | 0 |
| Empirical evaluation of a Q-Learning Algorithm for Model-free Autonomous Soaring | Jul 18, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| On-line Building Energy Optimization using Deep Reinforcement Learning | Jul 18, 2017 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Fastest Convergence for Q-learning | Jul 12, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Q-Learning for Self-Organizing Networks Fault Management and Radio Performance Improvement | Jul 10, 2017 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Q-Learning Algorithm for VoLTE Closed-Loop Power Control in Indoor Small Cells | Jul 10, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning | Jun 22, 2017 | Action DetectionPosition | CodeCode Available | 0 |
| Reinforcement Learning under Model Mismatch | Jun 15, 2017 | modelQ-Learning | —Unverified | 0 |
| Learning to Learn from Noisy Web Videos | Jun 9, 2017 | Action RecognitionQ-Learning | —Unverified | 0 |
| Generalized Value Iteration Networks: Life Beyond Lattices | Jun 8, 2017 | Q-Learning | CodeCode Available | 0 |
| UCB Exploration via Q-Ensembles | Jun 5, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Implications of Decentralized Q-learning Resource Allocation in Wireless Networks | May 30, 2017 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning | May 20, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Comparison of Reinforcement Learning Techniques for Fuzzy Cloud Auto-Scaling | May 19, 2017 | ManagementQ-Learning | —Unverified | 0 |
| Learning to Represent Haptic Feedback for Partially-Observable Tasks | May 17, 2017 | Q-Learning | —Unverified | 0 |
| Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering | May 17, 2017 | ClusteringQ-Learning | —Unverified | 0 |
| Learning Hard Alignments with Variational Inference | May 16, 2017 | Hard AttentionImage Captioning | —Unverified | 0 |
| Discrete Sequential Prediction of Continuous Actions for Deep RL | May 14, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Iterations for Reinforcement Learning Problems in Continuous Time and Space -- Fundamental Theory and Methods | May 9, 2017 | Decision MakingQ-Learning | CodeCode Available | 0 |
| Deep Episodic Value Iteration for Model-based Meta-Reinforcement Learning | May 9, 2017 | Meta Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Equivalence Between Policy Gradients and Soft Q-Learning | Apr 21, 2017 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads | Apr 20, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Q-learning from Demonstrations | Apr 12, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Data-efficient Deep Reinforcement Learning for Dexterous Manipulation | Apr 10, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Pseudorehearsal in value function approximation | Mar 21, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Online Learning for Offloading and Autoscaling in Energy Harvesting Mobile Edge Computing | Mar 17, 2017 | Edge-computingManagement | —Unverified | 0 |
| Multi-step Reinforcement Learning: A Unifying Algorithm | Mar 3, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Bridging the Gap Between Value and Policy Based Reinforcement Learning | Feb 28, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning with Deep Energy-Based Policies | Feb 27, 2017 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Learning Control for Air Hockey Striking using Deep Reinforcement Learning | Feb 26, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |