Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1701–1750 of 1918 papers

Title	Date	Tasks	Status
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic	Nov 7, 2016	continuous-controlContinuous Control	CodeCode Available
Factors of Influence of the Overestimation Bias of Q-Learning	Oct 11, 2022	Q-Learning	CodeCode Available
MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal Control	Dec 20, 2024	Graph AttentionQ-Learning	CodeCode Available
ConQUR: Mitigating Delusional Bias in Deep Q-learning	Feb 27, 2020	Atari GamesQ-Learning	CodeCode Available
DeepQTest: Testing Autonomous Driving Systems with Reinforcement Learning and Real-world Weather Data	Oct 8, 2023	Autonomous DrivingQ-Learning	CodeCode Available
Making Deep Q-learning methods robust to time discretization	Jan 28, 2019	Deep Reinforcement LearningQ-Learning	CodeCode Available
Bootstrapped Meta-Learning	Sep 9, 2021	Efficient ExplorationFew-Shot Learning	CodeCode Available
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning	Jul 8, 2021	Hierarchical Reinforcement LearningQ-Learning	CodeCode Available
Deep Q-learning from Demonstrations	Apr 12, 2017	Decision MakingDeep Reinforcement Learning	CodeCode Available
Reinforcement Learning with Low-Complexity Liquid State Machines	Jun 4, 2019	Atari GamesDeep Reinforcement Learning	CodeCode Available
The Effects of Memory Replay in Reinforcement Learning	Oct 18, 2017	Deep Reinforcement LearningQ-Learning	CodeCode Available
Compressed Federated Reinforcement Learning with a Generative Model	Mar 26, 2024	modelQ-Learning	CodeCode Available
Deep Q-Learning for Nash Equilibria: Nash-DQN	Apr 23, 2019	Q-LearningReinforcement Learning	CodeCode Available
Mastering Percolation-like Games with Deep Learning	May 12, 2023	Deep LearningQ-Learning	CodeCode Available
Towards Symbolic Reinforcement Learning with Common Sense	Apr 23, 2018	Common Sense ReasoningDeep Reinforcement Learning	CodeCode Available
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines	Jun 4, 2020	Q-Learningreinforcement-learning	CodeCode Available
Deep Q learning for fooling neural networks	Nov 13, 2018	Q-LearningReinforcement Learning	CodeCode Available
Deep Q-Learning based Reinforcement Learning Approach for Network Intrusion Detection	Nov 27, 2021	Intrusion DetectionNetwork Intrusion Detection	CodeCode Available
An intelligent financial portfolio trading strategy using deep Q-learning	Jul 8, 2019	Q-Learning	CodeCode Available
Efficient Parallel Reinforcement Learning Framework using the Reactor Model	Dec 7, 2023	OpenAI GymQ-Learning	CodeCode Available
A Fairness-Oriented Reinforcement Learning Approach for the Operation and Control of Shared Micromobility Services	Mar 23, 2024	FairnessQ-Learning	CodeCode Available
Remember and Forget for Experience Replay	Jul 16, 2018	Deep Reinforcement LearningPolicy Gradient Methods	CodeCode Available
Meta-Black-Box-Optimization through Offline Q-function Learning	May 4, 2025	BenchmarkingMamba	CodeCode Available
UNIQ: Offline Inverse Q-learning for Avoiding Undesirable Demonstrations	Oct 10, 2024	Imitation LearningQ-Learning	CodeCode Available
Meta-Q-Learning	Sep 30, 2019	continuous-controlContinuous Control	CodeCode Available
Meta-Value Learning: a General Framework for Learning with Learning Awareness	Jul 17, 2023	Q-Learning	CodeCode Available
Adversarial Learning of a Sampler Based on an Unnormalized Distribution	Jan 3, 2019	FormQ-Learning	CodeCode Available
Deep Q-learning: a robust control approach	Jan 21, 2022	OpenAI GymQ-Learning	CodeCode Available
Deep Ordinal Reinforcement Learning	May 6, 2019	Deep Reinforcement LearningOpenAI Gym	CodeCode Available
Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted Regression	May 28, 2024	Imitation LearningMuJoCo	CodeCode Available
Orchestrated Value Mapping for Reinforcement Learning	Mar 14, 2022	Ensemble LearningQ-Learning	CodeCode Available
Angrier Birds: Bayesian reinforcement learning	Jan 6, 2016	Efficient ExplorationQ-Learning	CodeCode Available
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning	Oct 27, 2019	Deep Reinforcement LearningImitation Learning	CodeCode Available
Offline Contextual Bandits with Overparameterized Models	Jun 27, 2020	Multi-Armed BanditsQ-Learning	CodeCode Available
An Empirical Study of Deep Reinforcement Learning in Continuing Tasks	Jan 12, 2025	Deep Reinforcement LearningMuJoCo	CodeCode Available
Simulation of Nanorobots with Artificial Intelligence and Reinforcement Learning for Advanced Cancer Cell Detection and Tracking	Nov 4, 2024	Cell DetectionNavigate	CodeCode Available
PairVDN - Pair-wise Decomposed Value Functions	Mar 12, 2025	Q-Learning	CodeCode Available
Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning	May 27, 2019	Q-Learningreinforcement-learning	CodeCode Available
Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods	May 8, 2022	continuous-controlContinuous Control	CodeCode Available
Mixed-Integer Optimal Control via Reinforcement Learning: A Case Study on Hybrid Electric Vehicle Energy Management	May 2, 2023	continuous-controlContinuous Control	CodeCode Available
Variation-resistant Q-learning: Controlling and Utilizing Estimation Bias in Reinforcement Learning for Better Performance	Feb 1, 2021	Q-Learningreinforcement-learning	CodeCode Available
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation	Jul 24, 2023	GPUQ-Learning	CodeCode Available
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients	Sep 24, 2021	continuous-controlContinuous Control	CodeCode Available
RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient Minimum Radiation Exposure Pathway	Feb 1, 2024	Decision MakingDeep Reinforcement Learning	CodeCode Available
Boosting Soft Q-Learning by Bounding	Jun 26, 2024	Q-Learning	CodeCode Available
Automaton-Guided Curriculum Generation for Reinforcement Learning Agents	Apr 11, 2023	Decision MakingQ-Learning	CodeCode Available
Variations on the Reinforcement Learning performance of Blackjack	Aug 9, 2023	Q-Learningreinforcement-learning	CodeCode Available
Model-Free Adaptive Optimal Control of Episodic Fixed-Horizon Manufacturing Processes using Reinforcement Learning	Sep 18, 2018	Model Predictive ControlQ-Learning	CodeCode Available
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning	Sep 15, 2018	Deep Reinforcement LearningQ-Learning	CodeCode Available
Designing Neural Network Architectures using Reinforcement Learning	Nov 7, 2016	General Classificationimage-classification	CodeCode Available

Show:10 25 50

← PrevPage 35 of 39Next →

No leaderboard results yet.