SOTAVerified|Agents Browse Leaderboard About Blog

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 1918 papers

Title	Date	Tasks	Status
Evaluating Reinforcement Learning Algorithms for Navigation in Simulated Robotic Quadrupeds: A Comparative Study Inspired by Guide Dog Behaviour	Jul 17, 2025	Autonomous NavigationQ-Learning	—Unverified
Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing	Jul 15, 2025	Knowledge TracingMath	CodeCode Available
A Data-Ensemble-Based Approach for Sample-Efficient LQ Control of Linear Time-Varying Systems	Jun 30, 2025	Q-Learning	—Unverified
ADDQ: Adaptive Distributional Double Q-Learning	Jun 24, 2025	Distributional Reinforcement LearningMuJoCo	CodeCode Available
Reinforcement Learning-Based Policy Optimisation For Heterogeneous Radio Access	Jun 18, 2025	Q-Learningreinforcement-learning	—Unverified
Implicit Constraint-Aware Off-Policy Correction for Offline Reinforcement Learning	Jun 16, 2025	Q-Learning	—Unverified
ReinDSplit: Reinforced Dynamic Split Learning for Pest Recognition in Precision Agriculture	Jun 16, 2025	Q-LearningReinforcement Learning (RL)	—Unverified
"What are my options?": Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended)	Jun 11, 2025	DiversityQ-Learning	—Unverified
Q-learning-based Hierarchical Cooperative Local Search for Steelmaking-continuous Casting Scheduling Problem	Jun 10, 2025	Q-LearningScheduling	—Unverified
Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning	Jun 5, 2025	Q-LearningReinforcement Learning (RL)	—Unverified
Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning With Iterated Q-Learning	Jun 4, 2025	Q-Learning	—Unverified
Improving Performance of Spike-based Deep Q-Learning using Ternary Neurons	Jun 3, 2025	Atari GamesDecision Making	—Unverified
Reinforcement Learning for Hanabi	May 31, 2025	Card GamesDeep Reinforcement Learning	—Unverified
Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model	May 30, 2025	Q-Learning	—Unverified
On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment	May 29, 2025	Federated LearningPolicy Gradient Methods	—Unverified
Learning to Charge More: A Theoretical Study of Collusion by Q-Learning Agents	May 28, 2025	Q-Learning	—Unverified
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL	May 28, 2025	Bayesian OptimizationHyperparameter Optimization	—Unverified
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging	May 27, 2025	Q-Learning	—Unverified
Inverse Q-Learning Done Right: Offline Imitation Learning in Q^π-Realizable MDPs	May 26, 2025	Imitation LearningQ-Learning	CodeCode Available
Distributionally Robust Deep Q-Learning	May 25, 2025	Q-Learning	CodeCode Available
Reinforcement Learning for Stock Transactions	May 22, 2025	Q-Learningreinforcement-learning	—Unverified
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies	May 22, 2025	Offline RLQ-Learning	—Unverified
OPA-Pack: Object-Property-Aware Robotic Bin Packing	May 19, 2025	ObjectQ-Learning	—Unverified
When a Reinforcement Learning Agent Encounters Unknown Unknowns	May 19, 2025	AI AgentQ-Learning	—Unverified
Imagination-Limited Q-Learning for Offline Reinforcement Learning	May 18, 2025	D4RLQ-Learning	—Unverified

Show:10 25 50

← PrevPage 1 of 77Next →

No leaderboard results yet.