Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 1918 papers

Title	Date	Tasks	Status	Hype
Deep Active Inference for Partially Observable MDPs	Sep 8, 2020	Deep Reinforcement LearningQ-Learning	CodeCode Available	1
Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials	Oct 11, 2022	Offline RLQ-Learning	CodeCode Available	1
Towards Universal and Black-Box Query-Response Only Attack on LLMs with QROA	Jun 4, 2024	Q-Learning	CodeCode Available	1
Randomized Ensembled Double Q-Learning: Learning Fast Without a Model	Jan 15, 2021	MuJoCoQ-Learning	CodeCode Available	1
Reasoning with Latent Diffusion in Offline Reinforcement Learning	Sep 12, 2023	D4RLOffline RL	CodeCode Available	1
Regularized Softmax Deep Multi-Agent Q-Learning	Dec 1, 2021	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available	1
Research on Robot Path Planning Based on Reinforcement Learning	Apr 22, 2024	Q-Learningreinforcement-learning	CodeCode Available	1
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning	May 2, 2022	Data AugmentationQ-Learning	CodeCode Available	1
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past	Jun 10, 2019	Deep Reinforcement LearningMuJoCo	CodeCode Available	1
Reward Machines for Cooperative Multi-Agent Reinforcement Learning	Jul 3, 2020	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available	1
Adaptive Contention Window Design using Deep Q-learning	Nov 18, 2020	Q-LearningReinforcement Learning (RL)	CodeCode Available	1
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation	Jun 23, 2021	Continuous ControlQ-Learning	CodeCode Available	1
Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration	Jun 20, 2022	Atari GamesDeep Reinforcement Learning	CodeCode Available	1
A Search-Based Testing Approach for Deep Reinforcement Learning Agents	Jun 15, 2022	Autonomous DrivingDecision Making	CodeCode Available	1
ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives	Dec 8, 2021	Q-LearningReinforcement Learning (RL)	CodeCode Available	1
Backprop-Free Reinforcement Learning with Active Neural Generative Coding	Jul 10, 2021	Q-Learningreinforcement-learning	CodeCode Available	1
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor	Jan 4, 2018	Continuous ControlDecision Making	CodeCode Available	1
Solving Continuous Control via Q-learning	Oct 22, 2022	continuous-controlContinuous Control	CodeCode Available	1
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning	Feb 28, 2017	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available	1
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation	Jul 1, 2021	Data AugmentationQ-Learning	CodeCode Available	1
Automated Cloud Provisioning on AWS using Deep Reinforcement Learning	Sep 13, 2017	Cloud ComputingDeep Reinforcement Learning	CodeCode Available	1
An Optimistic Perspective on Offline Reinforcement Learning	Jul 10, 2019	Atari GamesDiversity	CodeCode Available	1
Table2Charts: Recommending Charts by Learning Shared Table Representations	Aug 24, 2020	Q-LearningRecommendation Systems	CodeCode Available	1
TempoRL: Learning When to Act	Jun 9, 2021	Continuous ControlQ-Learning	CodeCode Available	1
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error	Feb 3, 2024	Adversarial RobustnessDeep Reinforcement Learning	CodeCode Available	1
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption	Oct 19, 2023	Offline RLQ-Learning	CodeCode Available	1
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning	Jun 7, 2021	Multi-agent Reinforcement LearningOffline RL	CodeCode Available	1
A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning	May 27, 2024	Data AugmentationQ-Learning	CodeCode Available	1
An Optimistic Perspective on Offline Deep Reinforcement Learning	Jan 1, 2020	Atari GamesDeep Reinforcement Learning	CodeCode Available	1
A Stochastic Game Framework for Efficient Energy Management in Microgrid Networks	Feb 6, 2020	energy managementenergy trading	CodeCode Available	1
Benchmarking Deep Graph Generative Models for Optimizing New Drug Molecules for COVID-19	Feb 9, 2021	BenchmarkingQ-Learning	CodeCode Available	1
Boosting Continuous Control with Consistency Policy	Oct 10, 2023	continuous-controlContinuous Control	CodeCode Available	1
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning	Mar 9, 2023	Offline RLQ-Learning	CodeCode Available	1
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?	Dec 1, 2020	Feature EngineeringQ-Learning	CodeCode Available	1
Benchmarking Batch Deep Reinforcement Learning Algorithms	Oct 3, 2019	BenchmarkingDeep Reinforcement Learning	CodeCode Available	1
Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman Problem	Dec 8, 2020	Combinatorial OptimizationQ-Learning	CodeCode Available	1
Addressing Function Approximation Error in Actor-Critic Methods	Feb 26, 2018	Continuous ControlOpenAI Gym	CodeCode Available	1
Continuous Deep Q-Learning with Model-based Acceleration	Mar 2, 2016	continuous-controlContinuous Control	CodeCode Available	1
Deep Inverse Q-learning with Constraints	Aug 4, 2020	Q-Learning	CodeCode Available	1
FACMAC: Factored Multi-Agent Centralised Policy Gradients	Mar 14, 2020	MuJoCoMulti-agent Reinforcement Learning	CodeCode Available	1
Deep Reinforcement Learning-based Intelligent Traffic Signal Controls with Optimized CO2 emissions	Oct 19, 2023	Deep Reinforcement LearningQ-Learning	CodeCode Available	1
Deep Reinforcement Learning with Double Q-learning	Sep 22, 2015	Atari GamesDeep Reinforcement Learning	CodeCode Available	1
When should we prefer Decision Transformers for Offline Reinforcement Learning?	May 23, 2023	D4RLImitation Learning	CodeCode Available	1
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning	May 30, 2024	D4RLDenoising	CodeCode Available	1
Discriminator Soft Actor Critic without Extrinsic Rewards	Jan 19, 2020	Imitation LearningQ-Learning	CodeCode Available	1
Distilling Reinforcement Learning Tricks for Video Games	Jul 1, 2021	Q-Learningreinforcement-learning	CodeCode Available	1
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies	Apr 20, 2023	Offline RLQ-Learning	CodeCode Available	1
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization	Mar 28, 2023	D4RLOffline RL	CodeCode Available	1
EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models	Oct 9, 2020	Deep Reinforcement LearningEpidemiology	CodeCode Available	1
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning	May 17, 2021	Offline RLQ-Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 3 of 39Next →

No leaderboard results yet.