SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 125 of 1918 papers

TitleStatusHype
Evaluating Reinforcement Learning Algorithms for Navigation in Simulated Robotic Quadrupeds: A Comparative Study Inspired by Guide Dog Behaviour0
Personalized Exercise Recommendation with Semantically-Grounded Knowledge TracingCode0
A Data-Ensemble-Based Approach for Sample-Efficient LQ Control of Linear Time-Varying Systems0
ADDQ: Adaptive Distributional Double Q-LearningCode0
Reinforcement Learning-Based Policy Optimisation For Heterogeneous Radio Access0
Implicit Constraint-Aware Off-Policy Correction for Offline Reinforcement Learning0
ReinDSplit: Reinforced Dynamic Split Learning for Pest Recognition in Precision Agriculture0
"What are my options?": Explaining RL Agents with Diverse Near-Optimal Alternatives (Extended)0
Q-learning-based Hierarchical Cooperative Local Search for Steelmaking-continuous Casting Scheduling Problem0
Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning0
Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning With Iterated Q-Learning0
Improving Performance of Spike-based Deep Q-Learning using Ternary Neurons0
Reinforcement Learning for Hanabi0
Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model0
On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment0
Learning to Charge More: A Theoretical Study of Collusion by Q-Learning Agents0
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL0
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging0
Inverse Q-Learning Done Right: Offline Imitation Learning in Q^π-Realizable MDPsCode0
Distributionally Robust Deep Q-LearningCode0
Reinforcement Learning for Stock Transactions0
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies0
OPA-Pack: Object-Property-Aware Robotic Bin Packing0
When a Reinforcement Learning Agent Encounters Unknown Unknowns0
Imagination-Limited Q-Learning for Offline Reinforcement Learning0
Show:102550
← PrevPage 1 of 77Next →

No leaderboard results yet.