SOTAVerified|Agents Browse Leaderboard About

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1075 of 1918 papers

Title	Date	Tasks	Status
A Discrete-Time Switching System Analysis of Q-learning	Feb 17, 2021	Q-Learning	—Unverified
Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View	Jul 25, 2022	Q-Learning	—Unverified
Final Iteration Convergence Bound of Q-Learning: Switching System Approach	May 11, 2022	Q-Learningreinforcement-learning	—Unverified
Finite-Time Convergence Rates of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning	Oct 28, 2020	Multi-Task LearningQ-Learning	—Unverified
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach	Jun 9, 2023	Q-Learning	—Unverified
Finite-Time Analysis of Simultaneous Double Q-learning	Jun 14, 2024	Q-LearningReinforcement Learning (RL)	—Unverified
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation	Oct 3, 2023	Multi-Armed BanditsQ-Learning	—Unverified
Finite-Time Bounds for Two-Time-Scale Stochastic Approximation with Arbitrary Norm Contractions and Markovian Noise	Mar 24, 2025	Q-Learningreinforcement-learning	—Unverified
Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model	Feb 19, 2024	modelQ-Learning	—Unverified
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach	Mar 11, 2024	Q-Learning	—Unverified
FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations	Sep 28, 2022	Autonomous DrivingEdge-computing	—Unverified
Fire Threat Detection From Videos with Q-Rough Sets	Jan 21, 2021	Q-LearningSegmentation	—Unverified
Fitted Q-Learning for Relational Domains	Jun 10, 2020	Q-Learning	—Unverified
Learning in Discounted-cost and Average-cost Mean-field Games	Dec 31, 2019	Q-Learning	—Unverified
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning	Sep 9, 2019	Q-Learningreinforcement-learning	—Unverified
Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals	Sep 25, 2018	Q-Learningreinforcement-learning	—Unverified
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game	Feb 1, 2024	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Forecasting and stabilizing chaotic regimes in two macroeconomic models via artificial intelligence technologies and control methods	Feb 20, 2023	Decision MakingEvolutionary Algorithms	—Unverified
FPGA Architecture for Deep Learning and its application to Planetary Robotics	Jan 26, 2017	CPUQ-Learning	—Unverified
FRAC-Q-Learning: A Reinforcement Learning with Boredom Avoidance Processes for Social Robots	Nov 26, 2023	Q-Learningreinforcement-learning	—Unverified
From r to Q^*: Your Language Model is Secretly a Q-Function	Apr 18, 2024	Language ModelingLanguage Modelling	—Unverified
Frugal Reinforcement-based Active Learning	Dec 9, 2022	Active LearningDiversity	—Unverified
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion	Apr 7, 2023	Deep Reinforcement LearningMulti-Armed Bandits	—Unverified
Functional Stability of Discounted Markov Decision Processes Using Economic MPC Dissipativity Theory	Mar 31, 2022	Model Predictive ControlQ-Learning	—Unverified
Gap-Dependent Bounds for Federated Q-learning	Feb 5, 2025	Q-Learning	—Unverified

Show:10 25 50

← PrevPage 43 of 77Next →

No leaderboard results yet.