Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1751–1800 of 1918 papers

Title	Date	Tasks	Status
UCB Momentum Q-learning: Correcting the bias without forgetting	Mar 1, 2021	Q-Learning	CodeCode Available
PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning	Jul 16, 2020	Policy Gradient MethodsQ-Learning	CodeCode Available
Performing Deep Recurrent Double Q-Learning for Atari Games	Aug 16, 2019	Atari GamesDeep Reinforcement Learning	CodeCode Available
Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL	Dec 17, 2020	Deep Reinforcement Learningmodel	CodeCode Available
Single-partition adaptive Q-learning	Jul 14, 2020	Q-LearningReinforcement Learning (RL)	CodeCode Available
Active inference: demystified and compared	Sep 24, 2019	Atari GamesOpenAI Gym	CodeCode Available
BlockQNN: Efficient Block-wise Neural Network Architecture Generation	Aug 16, 2018	GPUimage-classification	CodeCode Available
From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries	Mar 27, 2024	Autonomous NavigationDecision Making	CodeCode Available
Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning	Jul 1, 2018	Hierarchical Reinforcement LearningQ-Learning	CodeCode Available
Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments	Apr 30, 2023	Motion PlanningQ-Learning	CodeCode Available
Automatic Data Augmentation by Learning the Deterministic Policy	Oct 18, 2019	Data AugmentationDeep Reinforcement Learning	CodeCode Available
DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation	Jan 9, 2018	Autonomous DrivingAutonomous Navigation	CodeCode Available
GAN Q-learning	May 13, 2018	Distributional Reinforcement LearningOpenAI Gym	CodeCode Available
Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing	Jul 15, 2025	Knowledge TracingMath	CodeCode Available
Revisiting Fundamentals of Experience Replay	Jul 13, 2020	Deep Reinforcement LearningDQN Replay Dataset	CodeCode Available
Bridging the Gap Between Target Networks and Functional Regularization	Jun 4, 2021	Deep Reinforcement LearningQ-Learning	CodeCode Available
Comprehensible Context-driven Text Game Playing	May 6, 2019	Q-Learning	CodeCode Available
Generalized Speedy Q-learning	Nov 1, 2019	Q-LearningReinforcement Learning	CodeCode Available
Generalized Value Iteration Networks: Life Beyond Lattices	Jun 8, 2017	Q-Learning	CodeCode Available
Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks	Apr 8, 2023	Deep Reinforcement LearningGraph Neural Network	CodeCode Available
Revisiting Prioritized Experience Replay: A Value Perspective	Feb 5, 2021	Atari GamesQ-Learning	CodeCode Available
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning	Dec 18, 2017	Deep Reinforcement LearningEvolutionary Algorithms	CodeCode Available
GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning	Mar 2, 2023	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective	Dec 2, 2018	Atari GamesQ-Learning	CodeCode Available
Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings	Oct 29, 2020	Change Point DetectionOff-policy evaluation	CodeCode Available
Goal-Conditioned Q-Learning as Knowledge Distillation	Aug 28, 2022	Knowledge DistillationQ-Learning	CodeCode Available
Reward Delay Attacks on Deep Reinforcement Learning	Sep 8, 2022	Deep Reinforcement LearningQ-Learning	CodeCode Available
Goal Recognition as Reinforcement Learning	Feb 13, 2022	Q-Learningreinforcement-learning	CodeCode Available
Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective	Jun 29, 2023	Feature EngineeringQ-Learning	CodeCode Available
DeepTPI: Test Point Insertion with Deep Reinforcement Learning	Jun 7, 2022	Deep Reinforcement LearningGraph Neural Network	CodeCode Available
Composable Deep Reinforcement Learning for Robotic Manipulation	Mar 19, 2018	Deep Reinforcement LearningQ-Learning	CodeCode Available
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions	May 31, 2022	Atari Gamescounterfactual	CodeCode Available
Automata Learning meets Shielding	Dec 4, 2022	Q-LearningReinforcement Learning (RL)	CodeCode Available
Dynamic-Weighted Simplex Strategy for Learning Enabled Cyber Physical Systems	Feb 6, 2019	Autonomous DrivingQ-Learning	CodeCode Available
Momentum-based Accelerated Q-learning	Oct 23, 2019	Atari GamesQ-Learning	CodeCode Available
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning	Feb 20, 2024	Imitation LearningQ-Learning	CodeCode Available
Monte Carlo Q-learning for General Game Playing	Feb 16, 2018	Board GamesQ-Learning	CodeCode Available
Deep Coordination Graphs	Sep 27, 2019	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available
Group Equivariant Deep Reinforcement Learning	Jul 1, 2020	Deep Reinforcement LearningInductive Bias	CodeCode Available
Autoequivariant Network Search via Group Decomposition	Apr 10, 2021	Inductive BiasNeural Architecture Search	CodeCode Available
Multi-Agent Advisor Q-Learning	Oct 26, 2021	Decision MakingMulti-agent Reinforcement Learning	CodeCode Available
Action Candidate Driven Clipped Double Q-learning for Discrete and Continuous Action Tasks	Mar 22, 2022	Q-Learning	CodeCode Available
Playing 2048 With Reinforcement Learning	Oct 20, 2021	Playing the Game of 2048Q-Learning	CodeCode Available
Deep Active Inference for Pixel-Based Discrete Control: Evaluation on the Car Racing Problem	Sep 9, 2021	Car RacingQ-Learning	CodeCode Available
Deep Reinforcement Learning with a Natural Language Action Space	Nov 14, 2015	Deep Reinforcement LearningQ-Learning	CodeCode Available
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL	Jul 20, 2024	Few-Shot Text ClassificationQ-Learning	CodeCode Available
Decoding fairness: a reinforcement learning perspective	Dec 20, 2024	FairnessImitation Learning	CodeCode Available
Deep-Q Learning with Hybrid Quantum Neural Network on Solving Maze Problems	Apr 20, 2023	Q-Learningreinforcement-learning	CodeCode Available
Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks	Aug 1, 2018	Deep Reinforcement LearningQ-Learning	CodeCode Available
Traffic Light Control with Reinforcement Learning	Aug 28, 2023	Q-Learningreinforcement-learning	CodeCode Available

Show:10 25 50

← PrevPage 36 of 39Next →

No leaderboard results yet.