SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 16511675 of 1918 papers

TitleStatusHype
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement LearningCode0
On Solving the 2-Dimensional Greedy Shooter Problem for UAVsCode0
Q-Learning Lagrange Policies for Multi-Action Restless BanditsCode0
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive EnvironmentsCode0
ADDQ: Adaptive Distributional Double Q-LearningCode0
Learning To Play Atari Games Using Dueling Q-Learning and Hebbian PlasticityCode0
Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality TighteningCode0
Using deep Q-learning to understand the tax evasion behavior of risk-averse firmsCode0
On the Estimation Bias in Double Q-LearningCode0
Self-Learning Cloud Controllers: Fuzzy Q-Learning for Knowledge EvolutionCode0
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control PriorsCode0
Self Punishment and Reward Backfill for Deep Q-LearningCode0
Enhancing Robot Assistive Behaviour with Reinforcement Learning and Theory of MindCode0
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement LearningCode0
Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot NavigationCode0
Double Q-PID algorithm for mobile robot controlCode0
Adaptive Symmetric Reward Noising for Reinforcement LearningCode0
Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic ArmCode0
Stochastic approximation with cone-contractive operators: Sharp _-bounds for Q-learningCode0
Distributionally Robust Deep Q-LearningCode0
Least-Squares Policy IterationCode0
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless NetworksCode0
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge DistillationCode0
Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement LearningCode0
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic MethodsCode0
Show:102550
← PrevPage 67 of 77Next →

No leaderboard results yet.