Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 1918 papers

Title	Date	Tasks	Status	Hype
Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy	Jul 4, 2024	Q-Learning	—Unverified	0
Artificial Intelligence and Algorithmic Price Collusion in Two-sided Markets	Jul 4, 2024	Q-Learning	—Unverified	0
Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather	Jul 2, 2024	Data AugmentationLIDAR Semantic Segmentation	CodeCode Available	2
Two-Step Q-Learning	Jul 2, 2024	Q-Learning	—Unverified	0
A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization	Jul 1, 2024	Deep Reinforcement Learningenergy management	—Unverified	0
Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning	Jun 30, 2024	D4RLOffline RL	—Unverified	0
Towards Secure and Efficient Data Scheduling for Vehicular Social Networks	Jun 28, 2024	Q-LearningScheduling	—Unverified	0
Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors	Jun 28, 2024	Car RacingQ-Learning	CodeCode Available	0
Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks	Jun 26, 2024	Autonomous VehiclesDecision Making	—Unverified	0
Boosting Soft Q-Learning by Bounding	Jun 26, 2024	Q-Learning	CodeCode Available	0
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention	Jun 24, 2024	Imitation LearningQ-Learning	—Unverified	0
A General Control-Theoretic Approach for Reinforcement Learning: Theory and Algorithms	Jun 20, 2024	Learning TheoryQ-Learning	—Unverified	0
Learning to Select Goals in Automated Planning with Deep-Q Learning	Jun 20, 2024	Q-Learning	—Unverified	0
Equivariant Offline Reinforcement Learning	Jun 20, 2024	Offline RLQ-Learning	—Unverified	0
EduQate: Generating Adaptive Curricula through RMABs in Education Settings	Jun 20, 2024	Multi-Armed BanditsQ-Learning	—Unverified	0
Reinforcement-Learning based routing for packet-optical networks with hybrid telemetry	Jun 18, 2024	Q-Learning	CodeCode Available	0
Catalytic evolution of cooperation in a population with behavioural bimodality	Jun 17, 2024	Q-Learning	—Unverified	0
Optimal Transport-Assisted Risk-Sensitive Q-Learning	Jun 17, 2024	Decision MakingQ-Learning	—Unverified	0
Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning	Jun 14, 2024	Decision MakingDeep Reinforcement Learning	—Unverified	0
Finite-Time Analysis of Simultaneous Double Q-learning	Jun 14, 2024	Q-LearningReinforcement Learning (RL)	—Unverified	0
Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors	Jun 12, 2024	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation	Jun 12, 2024	Q-Learning	CodeCode Available	0
PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer	Jun 10, 2024	continuous-controlContinuous Control	CodeCode Available	1
Fast-Fading Channel and Power Optimization of the Magnetic Inductive Cellular Network	Jun 7, 2024	Q-Learning	—Unverified	0
Online Frequency Scheduling by Learning Parallel Actions	Jun 7, 2024	Graph Neural NetworkQ-Learning	—Unverified	0
Stabilizing Extreme Q-learning by Maclaurin Expansion	Jun 7, 2024	D4RLOffline RL	CodeCode Available	0
Strategically Conservative Q-Learning	Jun 6, 2024	D4RLOffline RL	CodeCode Available	1
Bootstrapping Expectiles in Reinforcement Learning	Jun 6, 2024	Q-Learningreinforcement-learning	—Unverified	0
Age of Trust (AoT): A Continuous Verification Framework for Wireless Networks	Jun 4, 2024	PhilosophyQ-Learning	—Unverified	0
Algorithmic Collusion in Dynamic Pricing with Deep Reinforcement Learning	Jun 4, 2024	Deep Reinforcement LearningQ-Learning	—Unverified	0
Towards Universal and Black-Box Query-Response Only Attack on LLMs with QROA	Jun 4, 2024	Q-Learning	CodeCode Available	1
Tabular and Deep Learning for the Whittle Index	Jun 4, 2024	Deep LearningQ-Learning	—Unverified	0
How to discretize continuous state-action spaces in Q-learning: A symbolic control approach	Jun 3, 2024	Q-Learning	—Unverified	0
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation	May 31, 2024	Q-Learning	CodeCode Available	0
Approximate Global Convergence of Independent Learning in Multi-Agent Systems	May 30, 2024	Q-Learning	—Unverified	0
Q-learning as a monotone scheme	May 30, 2024	Deep Reinforcement LearningQ-Learning	—Unverified	0
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning	May 30, 2024	D4RLDenoising	CodeCode Available	1
Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost	May 29, 2024	Q-Learning	—Unverified	0
Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted Regression	May 28, 2024	Imitation LearningMuJoCo	CodeCode Available	0
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving	May 28, 2024	Autonomous DrivingBilevel Optimization	CodeCode Available	2
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization	May 28, 2024	D4RLOffline RL	CodeCode Available	0
Highway Reinforcement Learning	May 28, 2024	Q-Learningreinforcement-learning	—Unverified	0
Mutation-Bias Learning in Games	May 28, 2024	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning	May 27, 2024	Data AugmentationQ-Learning	CodeCode Available	1
Analysis of Multiscale Reinforcement Q-Learning Algorithms for Mean Field Control Games	May 27, 2024	Q-Learning	—Unverified	0
Reinforcement Learning for Jump-Diffusions, with Financial Applications	May 26, 2024	Q-Learningreinforcement-learning	—Unverified	0
An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS	May 26, 2024	Decision MakingQ-Learning	—Unverified	0
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning	May 24, 2024	Deep Reinforcement LearningQ-Learning	—Unverified	0
Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine	May 24, 2024	Q-LearningReinforcement Learning (RL)	—Unverified	0
Extracting Heuristics from Large Language Models for Reward Shaping in Reinforcement Learning	May 24, 2024	Language ModellingLarge Language Model	—Unverified	0

Show:10 25 50

← PrevPage 6 of 39Next →

No leaderboard results yet.