Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1651–1700 of 1918 papers

Title	Date	Tasks	Status
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning	Sep 10, 2024	Deep Reinforcement LearningOpenAI Gym	CodeCode Available
On Solving the 2-Dimensional Greedy Shooter Problem for UAVs	Nov 2, 2019	Q-Learningreinforcement-learning	CodeCode Available
Q-Learning Lagrange Policies for Multi-Action Restless Bandits	Jun 22, 2021	Multi-Armed BanditsQ-Learning	CodeCode Available
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments	Feb 24, 2021	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available
ADDQ: Adaptive Distributional Double Q-Learning	Jun 24, 2025	Distributional Reinforcement LearningMuJoCo	CodeCode Available
Learning To Play Atari Games Using Dueling Q-Learning and Hebbian Plasticity	May 22, 2024	Atari GamesDeep Reinforcement Learning	CodeCode Available
Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening	Nov 5, 2016	Atari GamesDeep Reinforcement Learning	CodeCode Available
Using deep Q-learning to understand the tax evasion behavior of risk-averse firms	Jan 29, 2018	Deep Reinforcement LearningQ-Learning	CodeCode Available
On the Estimation Bias in Double Q-Learning	Sep 29, 2021	Q-LearningValue prediction	CodeCode Available
Self-Learning Cloud Controllers: Fuzzy Q-Learning for Knowledge Evolution	Jul 2, 2015	Q-LearningSelf-Learning	CodeCode Available
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors	Jun 28, 2024	Car RacingQ-Learning	CodeCode Available
Self Punishment and Reward Backfill for Deep Q-Learning	Apr 10, 2020	Atari GamesDeep Reinforcement Learning	CodeCode Available
Enhancing Robot Assistive Behaviour with Reinforcement Learning and Theory of Mind	Nov 11, 2024	Q-Learning	CodeCode Available
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning	Jul 5, 2021	Atari GamesDeep Reinforcement Learning	CodeCode Available
Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation	Sep 29, 2017	Deep Reinforcement LearningNavigate	CodeCode Available
Double Q-PID algorithm for mobile robot control	Nov 1, 2018	Active LearningQ-Learning	CodeCode Available
Adaptive Symmetric Reward Noising for Reinforcement Learning	May 24, 2019	Autonomous DrivingQ-Learning	CodeCode Available
Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic Arm	Aug 28, 2023	Deep Reinforcement LearningQ-Learning	CodeCode Available
Stochastic approximation with cone-contractive operators: Sharp _-bounds for Q-learning	May 15, 2019	Q-LearningReinforcement Learning	CodeCode Available
Distributionally Robust Deep Q-Learning	May 25, 2025	Q-Learning	CodeCode Available
Least-Squares Policy Iteration	Dec 4, 2003	Q-Learningreinforcement-learning	CodeCode Available
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks	Feb 12, 2024	Ensemble LearningManagement	CodeCode Available
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation	Sep 1, 2021	Deep Reinforcement LearningGeneral Reinforcement Learning	CodeCode Available
Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning	Oct 5, 2022	Deep Reinforcement LearningQ-Learning	CodeCode Available
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods	Sep 22, 2021	continuous-controlContinuous Control	CodeCode Available
A critical assessment of reinforcement learning methods for microswimmer navigation in complex flows	May 8, 2025	Autonomous NavigationHyperparameter Optimization	CodeCode Available
Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms	Sep 1, 2019	Deep Reinforcement LearningQ-Learning	CodeCode Available
Multi-intention Inverse Q-learning for Interpretable Behavior Representation	Nov 23, 2023	Decision MakingQ-Learning	CodeCode Available
Reinforcement Learning with A* and a Deep Heuristic	Nov 19, 2018	Q-Learningreinforcement-learning	CodeCode Available
Evolution of cooperation in a bimodal mixture of conditional cooperators	Feb 11, 2025	Q-Learning	CodeCode Available
Route Planning for Last-Mile Deliveries Using Mobile Parcel Lockers: A Hybrid Q-Learning Network Approach	Sep 9, 2022	Q-Learning	CodeCode Available
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents	Feb 6, 2024	continuous-controlContinuous Control	CodeCode Available
Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNet	Dec 15, 2022	Deep Reinforcement LearningMulti-agent Reinforcement Learning	CodeCode Available
Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks	Nov 21, 2022	Q-Learningreinforcement-learning	CodeCode Available
Reinforcement Learning with Deep Energy-Based Policies	Feb 27, 2017	Q-Learningreinforcement-learning	CodeCode Available
Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks	Oct 6, 2018	AllMontezuma's Revenge	CodeCode Available
Reinforcement Learning with Dynamic Boltzmann Softmax Updates	Mar 14, 2019	Atari GamesQ-Learning	CodeCode Available
Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning	Feb 13, 2024	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available
Explainable and Safe Reinforcement Learning for Autonomous Air Mobility	Nov 24, 2022	Adversarial AttackDeep Reinforcement Learning	CodeCode Available
QMR:Q-learning based Multi-objective optimization Routing protocol for Flying Ad Hoc Networks	Nov 27, 2019	Q-Learning	CodeCode Available
Lookahead-Bounded Q-Learning	Jun 28, 2020	Q-Learning	CodeCode Available
Introspective Experience Replay: Look Back When Surprised	Jun 7, 2022	Q-Learningreinforcement-learning	CodeCode Available
Welfare and Fairness in Multi-objective Reinforcement Learning	Nov 30, 2022	FairnessMulti-Objective Reinforcement Learning	CodeCode Available
Low-rank State-action Value-function Approximation	Apr 18, 2021	Q-Learning	CodeCode Available
Deep Quality-Value (DQV) Learning	Sep 30, 2018	Atari GamesDeep Reinforcement Learning	CodeCode Available
M^2DQN: A Robust Method for Accelerating Deep Q-learning Network	Sep 16, 2022	Q-Learningreinforcement-learning	CodeCode Available
Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces	Oct 17, 2019	Q-Learningreinforcement-learning	CodeCode Available
Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games	Sep 24, 2020	Q-LearningReinforcement Learning (RL)	CodeCode Available
Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environment	Jul 20, 2023	continuous-controlContinuous Control	CodeCode Available
Diagnosing Bottlenecks in Deep Q-learning Algorithms	Feb 26, 2019	continuous-controlContinuous Control	CodeCode Available

Show:10 25 50

← PrevPage 34 of 39Next →

No leaderboard results yet.