Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 1918 papers

Title	Date	Tasks	Status	Hype
Gradient Temporal-Difference Learning with Regularized Corrections	Jul 1, 2020	Q-Learning	CodeCode Available	1
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls	Oct 27, 2020	continuous-controlContinuous Control	CodeCode Available	1
Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient	Oct 13, 2022	Montezuma's RevengeQ-Learning	CodeCode Available	1
A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities	Nov 5, 2020	Q-Learningreinforcement-learning	—Unverified	0
A Hybrid Q-Learning Sine-Cosine-based Strategy for Addressing the Combinatorial Test Suite Minimization Problem	Apr 27, 2018	Q-Learning	—Unverified	0
Adaptive Stochastic Resource Control: A Machine Learning Approach	Jan 15, 2014	BIG-bench Machine LearningClustering	—Unverified	0
A Hybrid PAC Reinforcement Learning Algorithm	Sep 5, 2020	Q-Learningreinforcement-learning	—Unverified	0
A Graph Attention Learning Approach to Antenna Tilt Optimization	Dec 27, 2021	Graph AttentionQ-Learning	—Unverified	0
Adaptive Services Function Chain Orchestration For Digital Health Twin Use Cases: Heuristic-boosted Q-Learning Approach	Apr 25, 2023	Q-LearningScheduling	—Unverified	0
A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control	Aug 10, 2023	Deep Reinforcement LearningQ-Learning	—Unverified	0
Agnostic Q-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity	Dec 1, 2020	Q-Learning	—Unverified	0
Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity	Feb 17, 2020	Q-Learning	—Unverified	0
Adaptive Q-learning for Interaction-Limited Reinforcement Learning	Sep 29, 2021	Offline RLQ-Learning	—Unverified	0
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance	Nov 17, 2021	continuous-controlContinuous Control	—Unverified	0
A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm	Aug 9, 2024	Q-Learning	—Unverified	0
Adaptive Modulation and Coding based on Reinforcement Learning for 5G Networks	Nov 25, 2019	Q-Learningreinforcement-learning	—Unverified	0
A Comparative Study of AI-based Intrusion Detection Techniques in Critical Infrastructures	Jul 24, 2020	Intrusion DetectionManagement	—Unverified	0
QADQN: Quantum Attention Deep Q-Network for Financial Market Prediction	Aug 6, 2024	Decision MakingQ-Learning	—Unverified	0
Age of Trust (AoT): A Continuous Verification Framework for Wireless Networks	Jun 4, 2024	PhilosophyQ-Learning	—Unverified	0
Age-of-information minimization via opportunistic sampling by an energy harvesting source	Jan 8, 2022	Q-Learning	—Unverified	0
Adaptive Knowledge-based Multi-Objective Evolutionary Algorithm for Hybrid Flow Shop Scheduling Problems with Multiple Parallel Batch Processing Stages	Sep 27, 2024	Q-LearningScheduling	—Unverified	0
Age of Information Minimization using Multi-agent UAVs based on AI-Enhanced Mean Field Resource Allocation	Apr 24, 2024	Q-LearningScheduling	—Unverified	0
Agent-state based policies in POMDPs: Beyond belief-state MDPs	Sep 24, 2024	Q-Learning	—Unverified	0
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback	Jun 20, 2023	MuJoCoQ-Learning	—Unverified	0
A Comparative Analysis of Portfolio Optimization Using Mean-Variance, Hierarchical Risk Parity, and Reinforcement Learning Approaches on the Indian Stock Market	May 27, 2023	Portfolio OptimizationQ-Learning	—Unverified	0
A Comparative Analysis of Deep Reinforcement Learning-enabled Freeway Decision-making for Automated Vehicles	Aug 4, 2020	Autonomous DrivingAutonomous Vehicles	—Unverified	0
A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging	May 27, 2025	Q-Learning	—Unverified	0
Reinforcement Learning for an Efficient and Effective Malware Investigation during Cyber Incident Response	Aug 4, 2024	Decision MakingMalware Analysis	—Unverified	0
A General Framework for Learning Mean-Field Games	Mar 13, 2020	Decision MakingMulti-agent Reinforcement Learning	—Unverified	0
A General Control-Theoretic Approach for Reinforcement Learning: Theory and Algorithms	Jun 20, 2024	Learning TheoryQ-Learning	—Unverified	0
A Reinforcement Learning Perspective on the Optimal Control of Mutation Probabilities for the (1+1) Evolutionary Algorithm: First Results on the OneMax Problem	May 9, 2019	Evolutionary AlgorithmsQ-Learning	—Unverified	0
A storage expansion planning framework using reinforcement learning and simulation-based optimization	Jan 10, 2020	Decision MakingQ-Learning	—Unverified	0
A short variational proof of equivalence between policy gradients and soft Q learning	Dec 22, 2017	Q-Learningreinforcement-learning	—Unverified	0
Adapting Double Q-Learning for Continuous Reinforcement Learning	Sep 25, 2023	MuJoCoQ-Learning	—Unverified	0
A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks	May 20, 2023	Q-Learningreinforcement-learning	—Unverified	0
Toward Packet Routing with Fully-distributed Multi-agent Deep Reinforcement Learning	May 9, 2019	Decision MakingDeep Reinforcement Learning	—Unverified	0
Actuator Trajectory Planning for UAVs with Overhead Manipulator using Reinforcement Learning	Aug 24, 2023	Motion PlanningNavigate	—Unverified	0
A Flexible Framework for Incorporating Patient Preferences Into Q-Learning	Jul 22, 2023	Q-Learning	—Unverified	0
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning	Dec 22, 2024	D4RLQ-Learning	—Unverified	0
Artificial Intelligence and Auction Design	Feb 12, 2022	Q-Learning	—Unverified	0
A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation	Jun 6, 2018	Q-LearningReinforcement Learning	—Unverified	0
A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation	Dec 10, 2019	Deep Reinforcement LearningQ-Learning	—Unverified	0
Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning	Jul 3, 2023	Q-Learningreinforcement-learning	—Unverified	0
A finite time analysis of distributed Q-learning	May 23, 2024	Decision MakingMulti-agent Reinforcement Learning	—Unverified	0
A Finite Sample Complexity Bound for Distributionally Robust Q-learning	Feb 26, 2023	Q-Learning	—Unverified	0
Active Perception and Representation for Robotic Manipulation	Mar 15, 2020	Q-LearningReinforcement Learning	—Unverified	0
An Agile Adaptation Method for Multi-mode Vehicle Communication Networks	Jul 18, 2024	Q-Learningreinforcement-learning	—Unverified	0
Artificial Intelligence and Dual Contract	Mar 22, 2023	Q-Learning	—Unverified	0
A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning	Jan 16, 2022	Deep Reinforcement LearningHierarchical Reinforcement Learning	—Unverified	0
Active Measure Reinforcement Learning for Observation Cost Minimization	May 26, 2020	Decision MakingQ-Learning	—Unverified	0

Show:10 25 50

← PrevPage 4 of 39Next →

No leaderboard results yet.