SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 391400 of 1918 papers

TitleStatusHype
Bridging the Gap Between Value and Policy Based Reinforcement Learning0
APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games0
Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis0
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation0
Application of Deep Q-Network in Portfolio Management0
Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy0
Continuous-time q-learning for mean-field control problems0
Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty0
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning0
Breaking the Deadly Triad with a Target Network0
Show:102550
← PrevPage 40 of 192Next →

No leaderboard results yet.