SOTAVerified|Agents Browse Leaderboard About

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1181–1190 of 1918 papers

Title	Date	Tasks	Status	Hype	Score
Representation Learning for Context-Dependent Decision-Making	May 12, 2022	Decision MakingQ-Learning	—Unverified	0	0
Representing Entropy : A short proof of the equivalence between soft Q-learning and policy gradients	Jan 1, 2018	Q-Learningreinforcement-learning	—Unverified	0	0
Reputation Bootstrapping for Composite Services using CP-nets	May 27, 2021	Q-Learning	—Unverified	0	0
Residual Policy Gradient: A Reward View of KL-regularized Objective	Mar 14, 2025	Imitation LearningMuJoCo	—Unverified	0	0
Residual Q-Learning: Offline and Online Policy Customization without Value	Jun 15, 2023	Imitation LearningQ-Learning	—Unverified	0	0
Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning	Feb 3, 2025	Meta-LearningOffline RL	—Unverified	0	0
The state-of-the-art review on resource allocation problem using artificial intelligence methods on various computing paradigms	Mar 23, 2022	Cloud ComputingDeep Reinforcement Learning	—Unverified	0	0
REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes	Jan 16, 2024	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0	0
Reverse Experience Replay	Oct 19, 2019	Q-Learning	—Unverified	0	0
Reversible Action Design for Combinatorial Optimization with Reinforcement Learning	Feb 14, 2021	Combinatorial OptimizationQ-Learning	—Unverified	0	0

Show:10 25 50

← PrevPage 119 of 192Next →

No leaderboard results yet.