SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 15761600 of 1918 papers

TitleStatusHype
A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks0
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation0
An Adiabatic Theorem for Policy Tracking with TD-learning0
Analysis of Multiscale Reinforcement Q-Learning Algorithms for Mean Field Control Games0
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent0
Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit0
Analytically Tractable Bayesian Deep Q-Learning0
Analytics of Business Time Series Using Machine Learning and Bayesian Inference0
Analyzing Robustness of the Deep Reinforcement Learning Algorithm in Ramp Metering Applications Considering False Data Injection Attack and Defense0
An Attempt to Model Human Trust with Reinforcement Learning0
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation0
An Efficient and Uncertainty-aware Reinforcement Learning Framework for Quality Assurance in Extrusion Additive Manufacturing0
An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems0
An Elementary Proof that Q-learning Converges Almost Surely0
An Empirical Investigation of Value-Based Multi-objective Reinforcement Learning for Stochastic Environments0
A Nesterov's Accelerated quasi-Newton method for Global Routing using Deep Reinforcement Learning0
A Network Simulation of OTC Markets with Multiple Agents0
An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS0
A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward0
A new convergent variant of Q-learning with linear function approximation0
A new multilayer optical film optimal method based on deep q-learning0
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation0
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning0
An Independent Study of Reinforcement Learning and Autonomous Driving0
An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking0
Show:102550
← PrevPage 64 of 77Next →

No leaderboard results yet.