Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1200 of 1918 papers

Title	Date	Tasks	Status
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective	May 12, 2021	Offline RLQ-Learning	—Unverified
Inverse Factorized Q-Learning for Cooperative Multi-agent Imitation Learning	Oct 10, 2023	Imitation LearningQ-Learning	—Unverified
Inverse Policy Evaluation for Value-based Sequential Decision-making	Aug 26, 2020	Decision MakingQ-Learning	—Unverified
Inverse RL Scene Dynamics Learning for Nonlinear Predictive Control in Autonomous Vehicles	Apr 2, 2025	Autonomous NavigationAutonomous Vehicles	—Unverified
Investigating Reinforcement Learning Agents for Continuous State Space Environments	Aug 8, 2017	OpenAI GymQ-Learning	—Unverified
Investigating the Edge of Stability Phenomenon in Reinforcement Learning	Jul 9, 2023	Q-Learningreinforcement-learning	—Unverified
Investigating the Properties of Neural Network Representations in Reinforcement Learning	Mar 30, 2022	Deep Reinforcement LearningQ-Learning	—Unverified
IoT-Aerial Base Station Task Offloading with Risk-Sensitive Reinforcement Learning for Smart Agriculture	Sep 15, 2022	Q-LearningReinforcement Learning (RL)	—Unverified
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control	Jun 1, 2023	D4RLModel-based Reinforcement Learning	—Unverified
Is Q-learning an Ill-posed Problem?	Feb 20, 2025	Q-Learningreinforcement-learning	—Unverified
Is Q-Learning Provably Efficient? An Extended Analysis	Sep 22, 2020	Q-Learningreinforcement-learning	—Unverified
Is Risk-Sensitive Reinforcement Learning Properly Resolved?	Jul 2, 2023	Distributional Reinforcement LearningManagement	—Unverified
"Jam Me If You Can'': Defeating Jammer with Deep Dueling Neural Network Architecture and Ambient Backscattering Augmented Communications	Apr 8, 2019	Deep Reinforcement LearningQ-Learning	—Unverified
Joint Inference of Reward Machines and Policies for Reinforcement Learning	Sep 12, 2019	Q-Learningreinforcement-learning	—Unverified
Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator	Apr 1, 2018	Information RetrievalQ-Learning	—Unverified
Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics	Apr 20, 2022	Q-Learningreinforcement-learning	—Unverified
Joint User Association, Interference Cancellation and Power Control for Multi-IRS Assisted UAV Communications	Dec 8, 2023	Q-LearningScheduling	—Unverified
KAN v.s. MLP for Offline Reinforcement Learning	Sep 15, 2024	D4RLKolmogorov-Arnold Networks	—Unverified
Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes	Feb 21, 2023	Learning TheoryMedical Diagnosis	—Unverified
Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine	May 24, 2024	Q-LearningReinforcement Learning (RL)	—Unverified
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics	Nov 2, 2021	D4RLData Augmentation	—Unverified
K-spin Hamiltonian for quantum-resolvable Markov decision processes	Apr 13, 2020	Q-LearningReinforcement Learning	—Unverified
Language Inference with Multi-head Automata through Reinforcement Learning	Oct 20, 2020	Q-Learningreinforcement-learning	—Unverified
Large-Scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning	Aug 10, 2019	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Late Breaking Results: Breaking Symmetry- Unconventional Placement of Analog Circuits using Multi-Level Multi-Agent Reinforcement Learning	Mar 29, 2025	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Learning agents with prioritization and parameter noise in continuous state and action space	May 1, 2019	Autonomous VehiclesQ-Learning	—Unverified
Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space	Oct 15, 2024	Autonomous VehiclesQ-Learning	—Unverified
Learning Augmented Index Policy for Optimal Service Placement at the Network Edge	Jan 10, 2021	Q-Learning	—Unverified
Learning Automata Based Q-learning for Content Placement in Cooperative Caching	Mar 30, 2019	Q-Learning	—Unverified
Learning-Based Joint User-AP Association and Resource Allocation in Ultra Dense Network	May 28, 2020	Q-Learning	—Unverified
Learning-Based Strategy Design for Robot-Assisted Reminiscence Therapy Based on a Developed Model for People with Dementia	Sep 6, 2021	Q-Learning	—Unverified
Learning Best Response Strategies for Agents in Ad Exchanges	Feb 10, 2019	Q-Learning	—Unverified
Learning Control for Air Hockey Striking using Deep Reinforcement Learning	Feb 26, 2017	Deep Reinforcement LearningQ-Learning	—Unverified
Learning Dexterous Manipulation from Suboptimal Experts	Oct 16, 2020	Offline RLQ-Learning	—Unverified
Learning Dialog Policies from Weak Demonstrations	Apr 23, 2020	Atari GamesDeep Reinforcement Learning	—Unverified
Learning Efficient Parameter Server Synchronization Policies for Distributed SGD	May 1, 2020	Q-LearningReinforcement Learning (RL)	—Unverified
Learning Explicit Credit Assignment for Multi-agent Joint Q-learning	Sep 29, 2021	Q-Learning	—Unverified
Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing	Sep 16, 2021	FairnessManagement	—Unverified
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network	Feb 1, 2025	continuous-controlContinuous Control	—Unverified
Learning Gaussian Policies from Smoothed Action Value Functions	Jan 1, 2018	continuous-controlContinuous Control	—Unverified
Learning Hard Alignments with Variational Inference	May 16, 2017	Hard AttentionImage Captioning	—Unverified
Learning in complex action spaces without policy gradients	Oct 8, 2024	Policy Gradient MethodsQ-Learning	—Unverified
Learning medical triage from clinicians using Deep Q-Learning	Mar 28, 2020	Decision MakingDeep Reinforcement Learning	—Unverified
Learning Movement Strategies for Moving Target Defense	Jan 1, 2021	Q-Learning	—Unverified
Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation	Sep 1, 2020	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Learning Negotiating Behavior Between Cars in Intersections using Deep Q-Learning	Oct 24, 2018	Q-Learning	—Unverified
Learning Neural Control Barrier Functions from Offline Data with Conservatism	May 1, 2025	Q-Learning	—Unverified
Learning Sampling Policies for Domain Adaptation	May 19, 2018	ClassificationDomain Adaptation	—Unverified
Learning Self-Awareness Models for Physical Layer Security in Cognitive and AI-enabled Radios	Nov 23, 2022	Q-Learning	—Unverified
Learning Self-Imitating Diverse Policies	May 25, 2018	continuous-controlContinuous Control	—Unverified

Show:10 25 50

← PrevPage 24 of 39Next →

No leaderboard results yet.