SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 15011550 of 1918 papers

TitleStatusHype
A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning0
Toward Synergic Learning for Autonomous Manipulation of Deformable Tissues via Surgical Robots: An Approximate Q-Learning Approach0
Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals0
Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions0
Combining No-regret and Q-learningCode0
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action0
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping0
Fair Loss: Margin-Aware Reinforcement Learning for Deep Face Recognition0
Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization0
Meta-Q-LearningCode0
Q-learning for POMDP: An application to learning locomotion gaits0
Deep Coordination GraphsCode0
Visual Exploration and Energy-aware Path Planning via Reinforcement LearningCode0
CAQL: Continuous Action Q-Learning0
Long-term planning, short-term adjustments0
CAN ALTQ LEARN FASTER: EXPERIMENTS AND THEORY0
Policy Tree Network0
Striving for Simplicity in Off-Policy Deep Reinforcement Learning0
QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error0
Off-policy Multi-step Q-learning0
Modeling Fake News in Social Networks with Deep Multi-Agent Reinforcement Learning0
Active inference: demystified and comparedCode0
On the Convergence of Approximate and Regularized Policy Iteration Schemes0
Dependency-Aware Computation Offloading in Mobile Edge Computing: A Reinforcement Learning Approach0
Split Deep Q-Learning for Robust Object Singulation0
ISL: A novel approach for deep explorationCode0
Joint Inference of Reward Machines and Policies for Reinforcement Learning0
SQLR: Short-Term Memory Q-Learning for Elastic Provisioning0
Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders0
Q-learning Assisted Energy-Aware Traffic Offloading and Cell Switching in Heterogeneous Networks0
A Deep Learning Approach to Grasping the InvisibleCode0
Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning0
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation0
Q-Learning Based Aerial Base Station Placement for Fairness Enhancement in Mobile Networks0
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning0
Self-driving scale car trained by Deep reinforcement learning0
Multi Pseudo Q-learning Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles0
Deep Reinforcement Learning for Control of Probabilistic Boolean NetworksCode0
Encoders and Decoders for Quantum Expander Codes Using Machine Learning0
Gradient Q(σ, λ): A Unified Algorithm with Function Approximation for Reinforcement Learning0
Q-DATA: Enhanced Traffic Flow Monitoring in Software-Defined Networks applying Q-learning0
Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithmsCode0
Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity0
Networked Control of Nonlinear Systems under Partial Observation Using Continuous Deep Q-Learning0
STMARL: A Spatio-Temporal Multi-Agent Reinforcement Learning Approach for Cooperative Traffic Light Control0
Deep Reinforcement Learning for Foreign Exchange Trading0
Performing Deep Recurrent Double Q-Learning for Atari GamesCode0
Learn How to Cook a New Recipe in a New House: Using Map Familiarization, Curriculum Learning, and Bandit Feedback to Learn Families of Text-Based Adventure GamesCode0
Large-Scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning0
Batch Recurrent Q-Learning for Backchannel Generation Towards Engaging Agents0
Show:102550
← PrevPage 31 of 39Next →

No leaderboard results yet.