SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 14511500 of 1918 papers

TitleStatusHype
Quantum Observables for continuous control of the Quantum Approximate Optimization Algorithm via Reinforcement Learning0
Efficient Drone Mobility Support Using Reinforcement Learning0
Asymptotics of Reinforcement Learning with Neural Networks0
Modelling Bahdanau Attention using Election methods aided by Q-Learning0
Two-stage WECC Composite Load Modeling: A Double Deep Q-Learning Networks Approach0
Challenging On Car Racing Problem from OpenAI gym0
On Solving the 2-Dimensional Greedy Shooter Problem for UAVsCode0
Generalized Speedy Q-learningCode0
Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning0
Biomimetic Ultra-Broadband Perfect Absorbers Optimised with Reinforcement Learning0
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement LearningCode0
ZPD Teaching Strategies for Deep Reinforcement Learning from DemonstrationsCode0
D-Point Trigonometric Path Planning based on Q-Learning in Uncertain Environments0
Deep Q-Learning for Same-Day Delivery with Vehicles and Drones0
Momentum-based Accelerated Q-learningCode0
Partially Detected Intelligent Traffic Signal Control: Environmental Adaptation0
Policy Learning for Malaria ControlCode0
Reverse Experience Replay0
Automatic Data Augmentation by Learning the Deterministic PolicyCode0
Adaptive Discretization for Episodic Reinforcement Learning in Metric SpacesCode0
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central InferenceCode0
On the Reduction of Variance and Overestimation of Deep Q-Learning0
Zap Q-Learning With Nonlinear Function Approximation0
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments0
A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning0
Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals0
Toward Synergic Learning for Autonomous Manipulation of Deformable Tissues via Surgical Robots: An Approximate Q-Learning Approach0
Combining No-regret and Q-learningCode0
Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions0
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action0
Benchmarking Batch Deep Reinforcement Learning AlgorithmsCode1
Fair Loss: Margin-Aware Reinforcement Learning for Deep Face Recognition0
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping0
Q-learning for POMDP: An application to learning locomotion gaits0
Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization0
Meta-Q-LearningCode0
Deep Coordination GraphsCode0
CAQL: Continuous Action Q-Learning0
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?Code1
Visual Exploration and Energy-aware Path Planning via Reinforcement LearningCode0
QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error0
Off-policy Multi-step Q-learning0
Modeling Fake News in Social Networks with Deep Multi-Agent Reinforcement Learning0
Long-term planning, short-term adjustments0
Striving for Simplicity in Off-Policy Deep Reinforcement Learning0
CAN ALTQ LEARN FASTER: EXPERIMENTS AND THEORY0
Policy Tree Network0
Active inference: demystified and comparedCode0
On the Convergence of Approximate and Regularized Policy Iteration Schemes0
Dependency-Aware Computation Offloading in Mobile Edge Computing: A Reinforcement Learning Approach0
Show:102550
← PrevPage 30 of 39Next →

No leaderboard results yet.