SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 121130 of 1918 papers

TitleStatusHype
FlapAI Bird: Training an Agent to Play Flappy Bird Using Reinforcement Learning TechniquesCode1
DisCor: Corrective Feedback in Reinforcement Learning via Distribution CorrectionCode1
FACMAC: Factored Multi-Agent Centralised Policy GradientsCode1
Optimistic Exploration even with a Pessimistic InitialisationCode1
Maxmin Q-learning: Controlling the Estimation Bias of Q-learningCode1
A Stochastic Game Framework for Efficient Energy Management in Microgrid NetworksCode1
Discriminator Soft Actor Critic without Extrinsic RewardsCode1
An Optimistic Perspective on Offline Deep Reinforcement LearningCode1
Benchmarking Batch Deep Reinforcement Learning AlgorithmsCode1
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?Code1
Show:102550
← PrevPage 13 of 192Next →

No leaderboard results yet.