SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 18011850 of 1918 papers

TitleStatusHype
Zap Q-Learning0
Curriculum Q-Learning for Visual Vocabulary Acquisition0
A reinforcement learning algorithm for building collaboration in multi-agent systems0
Classification with Costly Features using Deep Reinforcement LearningCode0
Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
A unified decision making framework for supply and demand management in microgrid networks0
Double Q(σ) and Q(σ, λ): Unifying Reinforcement Learning Control Algorithms0
The Effects of Memory Replay in Reinforcement LearningCode0
Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations0
Supervised Q-walk for Learning Vector Representation of Nodes in Networks0
Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot NavigationCode0
A Simple Reinforcement Learning Mechanism for Resource Allocation in LTE-A Networks with Markov Decision Process and Q-Learning0
An Optimal Online Method of Selecting Source Policies for Reinforcement Learning0
Improving Search through A3C Reinforcement Learning based Conversational Agent0
Constructing narrative using a generative model and continuous action policies0
BIBI System Description: Building with CNNs and Breaking with Deep Reinforcement Learning0
Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids0
Practical Block-wise Neural Network Architecture GenerationCode0
Investigating Reinforcement Learning Agents for Continuous State Space Environments0
Guiding Reinforcement Learning Exploration Using Natural Language0
Empirical evaluation of a Q-Learning Algorithm for Model-free Autonomous Soaring0
On-line Building Energy Optimization using Deep Reinforcement Learning0
Fastest Convergence for Q-learning0
Deep Q-Learning for Self-Organizing Networks Fault Management and Radio Performance Improvement0
Q-Learning Algorithm for VoLTE Closed-Loop Power Control in Indoor Small Cells0
A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement LearningCode0
Reinforcement Learning under Model Mismatch0
Learning to Learn from Noisy Web Videos0
Generalized Value Iteration Networks: Life Beyond LatticesCode0
UCB Exploration via Q-Ensembles0
Implications of Decentralized Q-learning Resource Allocation in Wireless NetworksCode0
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning0
A Comparison of Reinforcement Learning Techniques for Fuzzy Cloud Auto-Scaling0
Learning to Represent Haptic Feedback for Partially-Observable Tasks0
Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering0
Learning Hard Alignments with Variational Inference0
Discrete Sequential Prediction of Continuous Actions for Deep RL0
Policy Iterations for Reinforcement Learning Problems in Continuous Time and Space -- Fundamental Theory and MethodsCode0
Deep Episodic Value Iteration for Model-based Meta-Reinforcement Learning0
Equivalence Between Policy Gradients and Soft Q-Learning0
Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads0
Deep Q-learning from DemonstrationsCode0
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation0
Pseudorehearsal in value function approximation0
Online Learning for Offloading and Autoscaling in Energy Harvesting Mobile Edge Computing0
Multi-step Reinforcement Learning: A Unifying Algorithm0
Bridging the Gap Between Value and Policy Based Reinforcement Learning0
Reinforcement Learning with Deep Energy-Based PoliciesCode0
Learning Control for Air Hockey Striking using Deep Reinforcement Learning0
Show:102550
← PrevPage 37 of 39Next →

No leaderboard results yet.