SOTAVerified|Agents Browse Leaderboard About

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 751–775 of 1918 papers

Title	Date	Tasks	Status
Equivalence Between Policy Gradients and Soft Q-Learning	Apr 21, 2017	Policy Gradient MethodsQ-Learning	—Unverified
Fidelity-based Probabilistic Q-learning for Control of Quantum Systems	Jun 8, 2018	Q-LearningReinforcement Learning	—Unverified
Final Adaptation Reinforcement Learning for N-Player Games	Nov 29, 2021	Board GamesQ-Learning	—Unverified
Finding the best design parameters for optical nanostructures using reinforcement learning	Oct 18, 2018	BIG-bench Machine LearningQ-Learning	—Unverified
Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids	Oct 27, 2021	Q-Learningreinforcement-learning	—Unverified
Finite-Sample Analysis for SARSA with Linear Function Approximation	Feb 6, 2019	Q-LearningReinforcement Learning	—Unverified
Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks	Sep 10, 2016	Q-Learningreinforcement-learning	—Unverified
Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games	Dec 15, 2021	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning	Mar 7, 2023	Continuous ControlOffline RL	—Unverified
Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation	Mar 1, 2023	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Chrome Dino Run using Reinforcement Learning	Aug 15, 2020	Q-Learningreinforcement-learning	—Unverified
Finite-Time Analysis of Asynchronous Stochastic Approximation and Q-Learning	Feb 1, 2020	Q-Learning	—Unverified
Entropy-Augmented Entropy-Regularized Reinforcement Learning and a Continuous Path from Policy Gradient to Q-Learning	May 18, 2020	Q-Learning	—Unverified
Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View	Jul 25, 2022	Q-Learning	—Unverified
Entropic Risk Optimization in Discounted MDPs: Sample Complexity Bounds with a Generative Model	May 30, 2025	Q-Learning	—Unverified
Finite-Time Convergence Rates of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning	Oct 28, 2020	Multi-Task LearningQ-Learning	—Unverified
Chemoreception and chemotaxis of a three-sphere swimmer	May 5, 2022	Q-Learning	—Unverified
Ensemble Bootstrapping for Q-Learning	Feb 28, 2021	Atari GamesQ-Learning	—Unverified
Finite-Time Analysis of Simultaneous Double Q-learning	Jun 14, 2024	Q-LearningReinforcement Learning (RL)	—Unverified
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation	Oct 3, 2023	Multi-Armed BanditsQ-Learning	—Unverified
Finite-Time Bounds for Two-Time-Scale Stochastic Approximation with Arbitrary Norm Contractions and Markovian Noise	Mar 24, 2025	Q-Learningreinforcement-learning	—Unverified
Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model	Feb 19, 2024	modelQ-Learning	—Unverified
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach	Mar 11, 2024	Q-Learning	—Unverified
FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations	Sep 28, 2022	Autonomous DrivingEdge-computing	—Unverified
Characterizing the Action-Generalization Gap in Deep Q-Learning	May 11, 2022	Q-LearningReinforcement Learning (RL)	—Unverified

Show:10 25 50

← PrevPage 31 of 77Next →

No leaderboard results yet.