SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 15011525 of 1918 papers

TitleStatusHype
A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning0
Toward Synergic Learning for Autonomous Manipulation of Deformable Tissues via Surgical Robots: An Approximate Q-Learning Approach0
Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals0
Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions0
Combining No-regret and Q-learningCode0
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action0
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping0
Fair Loss: Margin-Aware Reinforcement Learning for Deep Face Recognition0
Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization0
Meta-Q-LearningCode0
Q-learning for POMDP: An application to learning locomotion gaits0
Deep Coordination GraphsCode0
Visual Exploration and Energy-aware Path Planning via Reinforcement LearningCode0
CAQL: Continuous Action Q-Learning0
Long-term planning, short-term adjustments0
CAN ALTQ LEARN FASTER: EXPERIMENTS AND THEORY0
Policy Tree Network0
Striving for Simplicity in Off-Policy Deep Reinforcement Learning0
QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error0
Off-policy Multi-step Q-learning0
Modeling Fake News in Social Networks with Deep Multi-Agent Reinforcement Learning0
Active inference: demystified and comparedCode0
On the Convergence of Approximate and Regularized Policy Iteration Schemes0
Dependency-Aware Computation Offloading in Mobile Edge Computing: A Reinforcement Learning Approach0
Split Deep Q-Learning for Robust Object Singulation0
Show:102550
← PrevPage 61 of 77Next →

No leaderboard results yet.