Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 1918 papers

Title	Date	Tasks	Status	Score
A Statistical Analysis of Polyak-Ruppert Averaged Q-learning	Dec 29, 2021	Q-Learning	CodeCode Available	5
Deep Reinforcement Learning for Multi-class Imbalanced Training	May 24, 2022	Deep Reinforcement Learningimbalanced classification	CodeCode Available	5
Control with adaptive Q-learning	Nov 3, 2020	OpenAI GymQ-Learning	CodeCode Available	5
Adversarial Learning of a Sampler Based on an Unnormalized Distribution	Jan 3, 2019	FormQ-Learning	CodeCode Available	5
Deep reinforcement learning for time series: playing idealized trading games	Mar 11, 2018	Deep Reinforcement LearningQ-Learning	CodeCode Available	5
Deep Reinforcement Learning for Control of Probabilistic Boolean Networks	Sep 7, 2019	Deep Reinforcement LearningQ-Learning	CodeCode Available	5
A Fairness-Oriented Reinforcement Learning Approach for the Operation and Control of Shared Micromobility Services	Mar 23, 2024	FairnessQ-Learning	CodeCode Available	5
Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation	Jun 12, 2024	Q-Learning	CodeCode Available	5
Deep Reinforcement Learning Based Parameter Control in Differential Evolution	May 20, 2019	Deep Reinforcement LearningQ-Learning	CodeCode Available	5
Provably efficient RL with Rich Observations via Latent State Decoding	Jan 25, 2019	ClusteringQ-Learning	CodeCode Available	5
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model	Oct 27, 2024	D4RLQ-Learning	CodeCode Available	5
QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds	Dec 13, 2017	BenchmarkingModel-based Reinforcement Learning	CodeCode Available	5
Deep Reinforcement Learning for Imbalanced Classification	Jan 5, 2019	ClassificationDecision Making	CodeCode Available	5
Q-Learning Lagrange Policies for Multi-Action Restless Bandits	Jun 22, 2021	Multi-Armed BanditsQ-Learning	CodeCode Available	5
Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks	Mar 29, 2018	Deep Reinforcement LearningQ-Learning	CodeCode Available	5
Deep Quality-Value (DQV) Learning	Sep 30, 2018	Atari GamesDeep Reinforcement Learning	CodeCode Available	5
DeepQTest: Testing Autonomous Driving Systems with Reinforcement Learning and Real-world Weather Data	Oct 8, 2023	Autonomous DrivingQ-Learning	CodeCode Available	5
Automaton-Guided Curriculum Generation for Reinforcement Learning Agents	Apr 11, 2023	Decision MakingQ-Learning	CodeCode Available	5
ADDQ: Adaptive Distributional Double Q-Learning	Jun 24, 2025	Distributional Reinforcement LearningMuJoCo	CodeCode Available	5
Deep Q-learning from Demonstrations	Apr 12, 2017	Decision MakingDeep Reinforcement Learning	CodeCode Available	5
Deep Recurrent Q-Learning vs Deep Q-Learning on a simple Partially Observable Markov Decision Process with Minecraft	Mar 11, 2019	MinecraftQ-Learning	CodeCode Available	5
Reinforcement Learning for Sampling on Temporal Medical Imaging Sequences	Aug 28, 2023	Image ReconstructionQ-Learning	CodeCode Available	5
Deep Q-Learning based Reinforcement Learning Approach for Network Intrusion Detection	Nov 27, 2021	Intrusion DetectionNetwork Intrusion Detection	CodeCode Available	5
Deep Q-learning: a robust control approach	Jan 21, 2022	OpenAI GymQ-Learning	CodeCode Available	5
Deep Q learning for fooling neural networks	Nov 13, 2018	Q-LearningReinforcement Learning	CodeCode Available	5
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning	Dec 18, 2017	Deep Reinforcement LearningEvolutionary Algorithms	CodeCode Available	5
A Comparison of Reward Functions in Q-Learning Applied to a Cart Position Problem	May 25, 2021	PositionQ-Learning	CodeCode Available	5
Deep Ordinal Reinforcement Learning	May 6, 2019	Deep Reinforcement LearningOpenAI Gym	CodeCode Available	5
Revisiting Prioritized Experience Replay: A Value Perspective	Feb 5, 2021	Atari GamesQ-Learning	CodeCode Available	5
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective	Dec 2, 2018	Atari GamesQ-Learning	CodeCode Available	5
Deep Q-Learning for Nash Equilibria: Nash-DQN	Apr 23, 2019	Q-LearningReinforcement Learning	CodeCode Available	5
Deep Reinforcement Learning Algorithms for Option Hedging	Apr 7, 2025	Deep Reinforcement LearningQ-Learning	CodeCode Available	5
Automatic Data Augmentation by Learning the Deterministic Policy	Oct 18, 2019	Data AugmentationDeep Reinforcement Learning	CodeCode Available	5
Crowd Intelligence for Early Misinformation Prediction on Social Media	Aug 8, 2024	Fact CheckingMisinformation	CodeCode Available	5
A Kernel Loss for Solving the Bellman Equation	May 25, 2019	Q-LearningReinforcement Learning	CodeCode Available	5
SABER: Data-Driven Motion Planner for Autonomously Navigating Heterogeneous Robots	Aug 3, 2021	Model Predictive ControlMotion Planning	CodeCode Available	5
CytonRL: an Efficient Reinforcement Learning Open-source Toolkit Implemented in C++	Apr 14, 2018	GPUQ-Learning	CodeCode Available	5
Automata Learning meets Shielding	Dec 4, 2022	Q-LearningReinforcement Learning (RL)	CodeCode Available	5
Adaptive Symmetric Reward Noising for Reinforcement Learning	May 24, 2019	Autonomous DrivingQ-Learning	CodeCode Available	5
Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera	Mar 13, 2023	Privacy PreservingQ-Learning	CodeCode Available	5
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference	Oct 15, 2019	Q-LearningReinforcement Learning	CodeCode Available	5
Self-Learning Cloud Controllers: Fuzzy Q-Learning for Knowledge Evolution	Jul 2, 2015	Q-LearningSelf-Learning	CodeCode Available	5
Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings	Oct 29, 2020	Change Point DetectionOff-policy evaluation	CodeCode Available	5
Decoding fairness: a reinforcement learning perspective	Dec 20, 2024	FairnessImitation Learning	CodeCode Available	5
Deep Active Inference for Pixel-Based Discrete Control: Evaluation on the Car Racing Problem	Sep 9, 2021	Car RacingQ-Learning	CodeCode Available	5
Dynamic-Weighted Simplex Strategy for Learning Enabled Cyber Physical Systems	Feb 6, 2019	Autonomous DrivingQ-Learning	CodeCode Available	5
Augmented Q Imitation Learning (AQIL)	Mar 31, 2020	Deep Reinforcement LearningImitation Learning	CodeCode Available	5
Decision Making in Non-Stationary Environments with Policy-Augmented Search	Jan 6, 2024	Decision MakingDecision Making Under Uncertainty	CodeCode Available	5
Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning	Nov 30, 2021	Autonomous VehiclesQ-Learning	CodeCode Available	5
Deep Coordination Graphs	Sep 27, 2019	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available	5

Show:10 25 50

← PrevPage 9 of 39Next →

No leaderboard results yet.