SOTAVerified|Agents Browse Leaderboard About

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 171–180 of 1918 papers

Title	Date	Tasks	Status	Hype
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging	May 27, 2025	Q-Learning	—Unverified	0
Inverse Q-Learning Done Right: Offline Imitation Learning in Q^π-Realizable MDPs	May 26, 2025	Imitation LearningQ-Learning	CodeCode Available	0
Distributionally Robust Deep Q-Learning	May 25, 2025	Q-Learning	CodeCode Available	0
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies	May 22, 2025	Offline RLQ-Learning	—Unverified	0
Reinforcement Learning for Stock Transactions	May 22, 2025	Q-Learningreinforcement-learning	—Unverified	0
OPA-Pack: Object-Property-Aware Robotic Bin Packing	May 19, 2025	ObjectQ-Learning	—Unverified	0
When a Reinforcement Learning Agent Encounters Unknown Unknowns	May 19, 2025	AI AgentQ-Learning	—Unverified	0
Imagination-Limited Q-Learning for Offline Reinforcement Learning	May 18, 2025	D4RLQ-Learning	—Unverified	0
Automatic Reward Shaping from Confounded Offline Data	May 16, 2025	Atari GamesDeep Reinforcement Learning	—Unverified	0
ShiQ: Bringing back Bellman to LLMs	May 16, 2025	Q-LearningReinforcement Learning (RL)	—Unverified	0

Show:10 25 50

← PrevPage 18 of 192Next →

No leaderboard results yet.