Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1551–1600 of 1918 papers

Title	Date	Tasks	Status
Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity	Feb 17, 2020	Q-Learning	—Unverified
Agnostic Q-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity	Dec 1, 2020	Q-Learning	—Unverified
A Graph Attention Learning Approach to Antenna Tilt Optimization	Dec 27, 2021	Graph AttentionQ-Learning	—Unverified
A Hybrid PAC Reinforcement Learning Algorithm	Sep 5, 2020	Q-Learningreinforcement-learning	—Unverified
A Hybrid Q-Learning Sine-Cosine-based Strategy for Addressing the Combinatorial Test Suite Minimization Problem	Apr 27, 2018	Q-Learning	—Unverified
A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities	Nov 5, 2020	Q-Learningreinforcement-learning	—Unverified
Adaptive Multi-Agent Deep Reinforcement Learning for Timely Healthcare Interventions	Sep 20, 2023	Deep Reinforcement LearningHyperparameter Optimization	—Unverified
AI on the Water: Applying DRL to Autonomous Vessel Navigation	Oct 23, 2023	Collision AvoidanceDecision Making	—Unverified
A Jointly Optimal Design of Control and Scheduling in Networked Systems under Denial-of-Service Attacks	Mar 10, 2021	Q-LearningScheduling	—Unverified
A Large Language Model-Enhanced Q-learning for Capacitated Vehicle Routing Problem with Time Windows	May 9, 2025	Combinatorial OptimizationLanguage Modeling	—Unverified
A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management	Mar 2, 2022	ManagementQ-Learning	—Unverified
Algorithmic Collusion and Price Discrimination: The Over-Usage of Data	Mar 10, 2024	Q-Learning	—Unverified
Algorithmic Collusion in Dynamic Pricing with Deep Reinforcement Learning	Jun 4, 2024	Deep Reinforcement LearningQ-Learning	—Unverified
Algorithmic Collusion under Observed Demand Shocks	Feb 20, 2025	Q-Learning	—Unverified
Algorithmic Trading with Fitted Q Iteration and Heston Model	May 18, 2018	Algorithmic TradingQ-Learning	—Unverified
A Lifetime Extended Energy Management Strategy for Fuel Cell Hybrid Electric Vehicles via Self-Learning Fuzzy Reinforcement Learning	Feb 13, 2023	energy managementManagement	—Unverified
Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise	Nov 20, 2024	Q-Learning	—Unverified
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants	Feb 2, 2021	Q-LearningReinforcement Learning (RL)	—Unverified
A Machine Learning Approach for Prosumer Management in Intraday Electricity Markets	Mar 11, 2022	BIG-bench Machine LearningManagement	—Unverified
A Machine Learning Approach for Task and Resource Allocation in Mobile Edge Computing Based Networks	Jul 20, 2020	BIG-bench Machine LearningEdge-computing	—Unverified
A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning	Aug 1, 2022	Asset ManagementDeep Reinforcement Learning	—Unverified
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret	Jun 8, 2020	Q-Learningreinforcement-learning	—Unverified
A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes	Oct 4, 2021	Q-Learningreinforcement-learning	—Unverified
Amortized Noisy Channel Neural Machine Translation	Dec 16, 2021	Imitation LearningKnowledge Distillation	—Unverified
Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways	Dec 6, 2020	Autonomous DrivingDecision Making	—Unverified
A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks	Jun 6, 2020	Q-LearningScheduling	—Unverified
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation	Sep 10, 2019	Q-LearningReinforcement Learning	—Unverified
An Adiabatic Theorem for Policy Tracking with TD-learning	Oct 24, 2020	Q-Learning	—Unverified
Analysis of Multiscale Reinforcement Q-Learning Algorithms for Mean Field Control Games	May 27, 2024	Q-Learning	—Unverified
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent	Jul 15, 2020	Atari GamesQ-Learning	—Unverified
Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit	Nov 18, 2022	Q-Learningreinforcement-learning	—Unverified
Analytically Tractable Bayesian Deep Q-Learning	Jun 21, 2021	Q-Learningreinforcement-learning	—Unverified
Analytics of Business Time Series Using Machine Learning and Bayesian Inference	May 25, 2022	Bayesian InferenceBIG-bench Machine Learning	—Unverified
Analyzing Robustness of the Deep Reinforcement Learning Algorithm in Ramp Metering Applications Considering False Data Injection Attack and Defense	Jan 28, 2023	Adversarial AttackDeep Reinforcement Learning	—Unverified
An Attempt to Model Human Trust with Reinforcement Learning	Sep 29, 2021	Decision MakingQ-Learning	—Unverified
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation	Nov 26, 2023	Q-LearningReinforcement Learning (RL)	—Unverified
An Efficient and Uncertainty-aware Reinforcement Learning Framework for Quality Assurance in Extrusion Additive Manufacturing	Mar 2, 2025	Q-LearningUncertainty Quantification	—Unverified
An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems	Dec 6, 2023	Q-Learning	—Unverified
An Elementary Proof that Q-learning Converges Almost Surely	Aug 5, 2021	Q-Learningreinforcement-learning	—Unverified
An Empirical Investigation of Value-Based Multi-objective Reinforcement Learning for Stochastic Environments	Jan 6, 2024	Multi-Objective Reinforcement LearningQ-Learning	—Unverified
A Nesterov's Accelerated quasi-Newton method for Global Routing using Deep Reinforcement Learning	Oct 15, 2020	Deep Reinforcement LearningQ-Learning	—Unverified
A Network Simulation of OTC Markets with Multiple Agents	May 3, 2024	Q-Learning	—Unverified
An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS	May 26, 2024	Decision MakingQ-Learning	—Unverified
A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward	Sep 24, 2020	Decision MakingQ-Learning	—Unverified
A new convergent variant of Q-learning with linear function approximation	Dec 1, 2020	Q-LearningReinforcement Learning (RL)	—Unverified
A new multilayer optical film optimal method based on deep q-learning	Dec 7, 2018	Q-Learning	—Unverified
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation	May 25, 2022	Q-Learningreinforcement-learning	—Unverified
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning	May 10, 2020	L2 RegularizationOpenAI Gym	—Unverified
An Independent Study of Reinforcement Learning and Autonomous Driving	Aug 20, 2021	Autonomous DrivingOpenAI Gym	—Unverified
An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking	Feb 19, 2024	Q-LearningScheduling	—Unverified

Show:10 25 50

← PrevPage 32 of 39Next →

No leaderboard results yet.