SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 10111020 of 1918 papers

TitleStatusHype
Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication0
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle0
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle0
Provably Efficient Q-Learning with Low Switching Cost0
Provably Efficient Reinforcement Learning with Aggregated States0
Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games0
Provably More Efficient Q-Learning in the One-Sided-Feedback/Full-Feedback Settings0
Direct Data-Driven Discrete-time Bilinear Biquadratic Regulator0
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care0
Pseudorehearsal in value function approximation0
Show:102550
← PrevPage 102 of 192Next →

No leaderboard results yet.