Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1250 of 1918 papers

Title	Date	Tasks	Status
Learning Sharing Behaviors with Arbitrary Numbers of Agents	Dec 10, 2018	Q-Learning	—Unverified
Learning Strategic Value and Cooperation in Multi-Player Stochastic Games through Side Payments	Mar 9, 2023	FormQ-Learning	—Unverified
Learning through Probing: a decentralized reinforcement learning architecture for social dilemmas	Sep 26, 2018	Deep Reinforcement LearningMulti-agent Reinforcement Learning	—Unverified
Learning Time Reduction Using Warm Start Methods for a Reinforcement Learning Based Supervisory Control in Hybrid Electric Vehicle Applications	Oct 27, 2020	Q-LearningReinforcement Learning (RL)	—Unverified
Learning to Charge More: A Theoretical Study of Collusion by Q-Learning Agents	May 28, 2025	Q-Learning	—Unverified
Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System	Oct 29, 2021	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Learning to Cooperate and Communicate Over Imperfect Channels	Nov 24, 2023	Q-Learning	—Unverified
Learning to Cooperate via Policy Search	Aug 7, 2014	Q-Learningreinforcement-learning	—Unverified
Learning to Coordinate with Coordination Graphs in Repeated Single-Stage Multi-Agent Decision Problems	Jul 1, 2018	Multi-Armed BanditsQ-Learning	—Unverified
Learning to Dynamically Coordinate Multi-Robot Teams in Graph Attention Networks	Dec 4, 2019	Combinatorial OptimizationGraph Attention	—Unverified
Learning to Explore via Meta-Policy Gradient	Jul 1, 2018	continuous-controlContinuous Control	—Unverified
Learning to Explore with Meta-Policy Gradient	Mar 13, 2018	Q-LearningReinforcement Learning	—Unverified
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning	May 20, 2017	Decision MakingDeep Reinforcement Learning	—Unverified
Learning to Learn from Noisy Web Videos	Jun 9, 2017	Action RecognitionQ-Learning	—Unverified
Maximizing Influence with Graph Neural Networks	Aug 10, 2021	Combinatorial OptimizationComputational Efficiency	—Unverified
Learning to Play Video Games with Intuitive Physics Priors	Sep 20, 2024	Decision MakingObject	—Unverified
Learning to predict where to look in interactive environments using deep recurrent q-learning	Dec 17, 2016	Atari GamesQ-Learning	—Unverified
Learning to Reason	Oct 12, 2018	Automated Theorem ProvingQ-Learning	—Unverified
Learning to Represent Haptic Feedback for Partially-Observable Tasks	May 17, 2017	Q-Learning	—Unverified
Learning to Select Goals in Automated Planning with Deep-Q Learning	Jun 20, 2024	Q-Learning	—Unverified
Learning to Sketch with Deep Q Networks and Demonstrated Strokes	Oct 14, 2018	Q-Learning	—Unverified
Learning Value Functions from Undirected State-only Experience	Apr 26, 2022	Future predictionImitation Learning	—Unverified
Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare	May 17, 2021	Q-Learning	—Unverified
Lifting the Veil: Unlocking the Power of Depth in Q-learning	Oct 27, 2023	Learning TheoryManagement	—Unverified
Linear Q-Learning Does Not Diverge: Convergence Rates to a Bounded Set	Jan 31, 2025	Q-Learning	—Unverified
Listwise Learning to Rank with Deep Q-Networks	Feb 13, 2020	Decision MakingLearning-To-Rank	—Unverified
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning	Jul 5, 2023	Offline RLQ-Learning	—Unverified
Location-routing Optimisation for Urban Logistics Using Mobile Parcel Locker Based on Hybrid Q-Learning Algorithm	Oct 29, 2021	Q-Learning	—Unverified
Logical Team Q-learning: An approach towards factored policies in cooperative MARL	Jun 5, 2020	Q-Learning	—Unverified
Logistic Q-Learning	Oct 21, 2020	Q-LearningReinforcement Learning (RL)	—Unverified
Long and Short Memory Balancing in Visual Co-Tracking using Q-Learning	Feb 14, 2019	Q-LearningReinforcement Learning	—Unverified
Long-term Fairness in Ride-Hailing Platform	Jul 25, 2024	FairnessQ-Learning	—Unverified
Long-term planning, short-term adjustments	Sep 25, 2019	Deep Reinforcement LearningPrediction	—Unverified
LOQA: Learning with Opponent Q-Learning Awareness	May 2, 2024	Q-Learning	—Unverified
MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning	Sep 17, 2022	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Machine learning-based decentralized TDMA for VLC IoT networks	Nov 23, 2023	Collision AvoidanceQ-Learning	—Unverified
Machine Learning Empowered Trajectory and Passive Beamforming Design in UAV-RIS Wireless Networks	Oct 6, 2020	BIG-bench Machine LearningQ-Learning	—Unverified
MACOptions: Multi-Agent Learning with Centralized Controller and Options Framework	Feb 7, 2023	Q-Learning	—Unverified
Managing App Install Ad Campaigns in RTB: A Q-Learning Approach	Nov 11, 2018	Q-Learning	—Unverified
Manipulating Reinforcement Learning: Poisoning Attacks on Cost Signals	Feb 7, 2020	Q-Learningreinforcement-learning	—Unverified
Many-Goals Reinforcement Learning	Jun 22, 2018	AllQ-Learning	—Unverified
Markov Decision Process modeled with Bandits for Sequential Decision Making in Linear-flow	Jul 1, 2021	Decision MakingMarketing	—Unverified
MARL-FWC: Optimal Coordination of Freeway Traffic Control Measures	Aug 27, 2018	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Maximizing User Connectivity in AI-Enabled Multi-UAV Networks: A Distributed Strategy Generalized to Arbitrary User Distributions	Nov 7, 2024	Deep Reinforcement LearningQ-Learning	—Unverified
Maximum entropy GFlowNets with soft Q-learning	Dec 21, 2023	Q-LearningReinforcement Learning (RL)	—Unverified
Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning	Dec 1, 2024	Decision MakingMulti-agent Reinforcement Learning	—Unverified
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning	Sep 22, 2021	Deep Reinforcement LearningGaussian Processes	—Unverified
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention	Jun 24, 2024	Imitation LearningQ-Learning	—Unverified
Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation	May 7, 2025	DisentanglementLightweight Deployment	—Unverified
Meta-Gradient Reinforcement Learning with an Objective Discovered Online	Jul 16, 2020	Deep Reinforcement LearningQ-Learning	—Unverified

Show:10 25 50

← PrevPage 25 of 39Next →

No leaderboard results yet.