Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 1918 papers

Title	Date	Tasks	Status
Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks	Jun 26, 2024	Autonomous VehiclesDecision Making	—Unverified
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention	Jun 24, 2024	Imitation LearningQ-Learning	—Unverified
EduQate: Generating Adaptive Curricula through RMABs in Education Settings	Jun 20, 2024	Multi-Armed BanditsQ-Learning	—Unverified
Equivariant Offline Reinforcement Learning	Jun 20, 2024	Offline RLQ-Learning	—Unverified
Learning to Select Goals in Automated Planning with Deep-Q Learning	Jun 20, 2024	Q-Learning	—Unverified
A General Control-Theoretic Approach for Reinforcement Learning: Theory and Algorithms	Jun 20, 2024	Learning TheoryQ-Learning	—Unverified
Reinforcement-Learning based routing for packet-optical networks with hybrid telemetry	Jun 18, 2024	Q-Learning	CodeCode Available
Optimal Transport-Assisted Risk-Sensitive Q-Learning	Jun 17, 2024	Decision MakingQ-Learning	—Unverified
Catalytic evolution of cooperation in a population with behavioural bimodality	Jun 17, 2024	Q-Learning	—Unverified
Finite-Time Analysis of Simultaneous Double Q-learning	Jun 14, 2024	Q-LearningReinforcement Learning (RL)	—Unverified
Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning	Jun 14, 2024	Decision MakingDeep Reinforcement Learning	—Unverified
Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors	Jun 12, 2024	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation	Jun 12, 2024	Q-Learning	CodeCode Available
Online Frequency Scheduling by Learning Parallel Actions	Jun 7, 2024	Graph Neural NetworkQ-Learning	—Unverified
Fast-Fading Channel and Power Optimization of the Magnetic Inductive Cellular Network	Jun 7, 2024	Q-Learning	—Unverified
Stabilizing Extreme Q-learning by Maclaurin Expansion	Jun 7, 2024	D4RLOffline RL	CodeCode Available
Bootstrapping Expectiles in Reinforcement Learning	Jun 6, 2024	Q-Learningreinforcement-learning	—Unverified
Age of Trust (AoT): A Continuous Verification Framework for Wireless Networks	Jun 4, 2024	PhilosophyQ-Learning	—Unverified
Tabular and Deep Learning for the Whittle Index	Jun 4, 2024	Deep LearningQ-Learning	—Unverified
Algorithmic Collusion in Dynamic Pricing with Deep Reinforcement Learning	Jun 4, 2024	Deep Reinforcement LearningQ-Learning	—Unverified
How to discretize continuous state-action spaces in Q-learning: A symbolic control approach	Jun 3, 2024	Q-Learning	—Unverified
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation	May 31, 2024	Q-Learning	CodeCode Available
Q-learning as a monotone scheme	May 30, 2024	Deep Reinforcement LearningQ-Learning	—Unverified
Approximate Global Convergence of Independent Learning in Multi-Agent Systems	May 30, 2024	Q-Learning	—Unverified
Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost	May 29, 2024	Q-Learning	—Unverified
Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted Regression	May 28, 2024	Imitation LearningMuJoCo	CodeCode Available
Mutation-Bias Learning in Games	May 28, 2024	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Highway Reinforcement Learning	May 28, 2024	Q-Learningreinforcement-learning	—Unverified
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization	May 28, 2024	D4RLOffline RL	CodeCode Available
Analysis of Multiscale Reinforcement Q-Learning Algorithms for Mean Field Control Games	May 27, 2024	Q-Learning	—Unverified
Reinforcement Learning for Jump-Diffusions, with Financial Applications	May 26, 2024	Q-Learningreinforcement-learning	—Unverified
An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS	May 26, 2024	Decision MakingQ-Learning	—Unverified
Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine	May 24, 2024	Q-LearningReinforcement Learning (RL)	—Unverified
Extracting Heuristics from Large Language Models for Reward Shaping in Reinforcement Learning	May 24, 2024	Language ModellingLarge Language Model	—Unverified
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning	May 24, 2024	Deep Reinforcement LearningQ-Learning	—Unverified
A finite time analysis of distributed Q-learning	May 23, 2024	Decision MakingMulti-agent Reinforcement Learning	—Unverified
Exclusively Penalized Q-learning for Offline Reinforcement Learning	May 23, 2024	Offline RLQ-Learning	—Unverified
Learning To Play Atari Games Using Dueling Q-Learning and Hebbian Plasticity	May 22, 2024	Atari GamesDeep Reinforcement Learning	CodeCode Available
Stochastic Q-learning for Large Discrete Action Spaces	May 16, 2024	Decision MakingQ-Learning	—Unverified
Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments	May 14, 2024	Decision MakingDeep Reinforcement Learning	—Unverified
Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning	May 14, 2024	Q-Learning	—Unverified
An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning	May 10, 2024	MisconceptionsMulti-agent Reinforcement Learning	—Unverified
An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models	May 9, 2024	Hierarchical Reinforcement LearningManagement	—Unverified
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems	May 7, 2024	CPUGPU	CodeCode Available
Enhancing Q-Learning with Large Language Model Heuristics	May 6, 2024	Decision MakingLanguage Modeling	—Unverified
Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach	May 3, 2024	Q-Learningreinforcement-learning	—Unverified
A Network Simulation of OTC Markets with Multiple Agents	May 3, 2024	Q-Learning	—Unverified
Regularized Q-learning through Robust Averaging	May 3, 2024	Q-Learning	CodeCode Available
LOQA: Learning with Opponent Q-Learning Awareness	May 2, 2024	Q-Learning	—Unverified
Cell Switching in HAPS-Aided Networking: How the Obscurity of Traffic Loads Affects the Decision	May 1, 2024	Q-Learning	—Unverified

Show:10 25 50

← PrevPage 9 of 39Next →

No leaderboard results yet.