SOTAVerified|Agents Browse Leaderboard About

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 376–400 of 1918 papers

Title	Date	Tasks	Status
Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper	Jun 29, 2020	OpenAI GymQ-Learning	—Unverified
Configuring Transmission Thresholds in IIoT Alarm Scenarios for Energy-Efficient Event Reporting	Jul 4, 2024	Q-LearningScheduling	—Unverified
A Novel Resource Allocation for Anti-jamming in Cognitive-UAVs: an Active Inference Approach	Aug 10, 2022	Bayesian InferenceQ-Learning	—Unverified
An Efficient and Uncertainty-aware Reinforcement Learning Framework for Quality Assurance in Extrusion Additive Manufacturing	Mar 2, 2025	Q-LearningUncertainty Quantification	—Unverified
Consecutive Task-oriented Dialog Policy Learning	Nov 16, 2021	Continual LearningManagement	—Unverified
An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models	May 9, 2024	Hierarchical Reinforcement LearningManagement	—Unverified
Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning With Iterated Q-Learning	Jun 4, 2025	Q-Learning	—Unverified
CoNSoLe: Convex Neural Symbolic Learning	Jun 1, 2022	Q-Learning	—Unverified
Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation	Jan 25, 2024	Q-LearningReinforcement Learning (RL)	—Unverified
Constrained Model-Free Reinforcement Learning for Process Optimization	Nov 16, 2020	modelModel Predictive Control	—Unverified
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning	Jul 19, 2021	Offline RLQ-Learning	—Unverified
Constructing narrative using a generative model and continuous action policies	Sep 1, 2017	Paraphrase IdentificationQ-Learning	—Unverified
Contextual Conservative Q-Learning for Offline Reinforcement Learning	Jan 3, 2023	MuJoCoQ-Learning	—Unverified
A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens	Jul 13, 2021	Q-Learning	—Unverified
Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts	Feb 29, 2020	Mixture-of-ExpertsOpenAI Gym	—Unverified
Bridging the Gap Between Value and Policy Based Reinforcement Learning	Feb 28, 2017	Q-Learningreinforcement-learning	—Unverified
APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games	Mar 17, 2025	Atari GamesQ-Learning	—Unverified
Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis	Sep 29, 2021	Deep Reinforcement LearningQ-Learning	—Unverified
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation	Nov 26, 2023	Q-LearningReinforcement Learning (RL)	—Unverified
Application of Deep Q-Network in Portfolio Management	Mar 13, 2020	Deep Reinforcement LearningFace Recognition	—Unverified
Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy	Jul 4, 2024	Q-Learning	—Unverified
Continuous-time q-learning for mean-field control problems	Jun 28, 2023	Q-Learning	—Unverified
Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty	Apr 19, 2024	Q-Learningreinforcement-learning	—Unverified
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning	Oct 9, 2021	Q-Learningreinforcement-learning	—Unverified
Breaking the Deadly Triad with a Target Network	Jan 21, 2021	Q-Learning	—Unverified

Show:10 25 50

← PrevPage 16 of 77Next →

No leaderboard results yet.