SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 14011425 of 1918 papers

TitleStatusHype
Untangling Braids with Multi-agent Q-Learning0
Urban traffic dynamic rerouting framework: A DRL-based model with fog-cloud architecture0
User Tampering in Reinforcement Learning Recommender Systems0
Using a Deep Reinforcement Learning Agent for Traffic Signal Control0
Using Deep Q-Learning to Control Optimization Hyperparameters0
Using Deep Q-Learning to Dynamically Toggle between Push/Pull Actions in Computational Trust Mechanisms0
Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners0
Using Reinforcement Learning to Herd a Robotic Swarm to a Target Distribution0
Using Reinforcement Learning to Optimize Responses in Care Processes: A Case Study on Aggression Incidents0
Utilizing Maximum Mean Discrepancy Barycenter for Propagating the Uncertainty of Value Functions in Reinforcement Learning0
VA-learning as a more efficient alternative to Q-learning0
Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings0
Value function interference and greedy action selection in value-based multi-objective reinforcement learning0
Value-of-Information based Arbitration between Model-based and Model-free Control0
Value Penalized Q-Learning for Recommender Systems0
Value Refinement Network (VRN)0
Vanishing Bias Heuristic-guided Reinforcement Learning Algorithm0
Variance-Reduced Cascade Q-learning: Algorithms and Sample Complexity0
Variance-reduced Q-learning is minimax optimal0
Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient0
Variance Reduction Methods for Sublinear Reinforcement Learning0
Variational Bayesian Reinforcement Learning with Regret Bounds0
Variational quantum compiling with double Q-learning0
Vehicle management in a modular production context using Deep Q-Learning0
Verification of Dissipativity and Evaluation of Storage Function in Economic Nonlinear MPC using Q-Learning0
Show:102550
← PrevPage 57 of 77Next →

No leaderboard results yet.