SOTAVerified|Agents Browse Leaderboard About Blog

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 226–250 of 1918 papers

Title	Date	Tasks	Status
Cycles and collusion in congestion games under Q-learning	Feb 26, 2025	Q-Learning	—Unverified
Policy Learning with a Natural Language Action Space: A Causal Approach	Feb 24, 2025	Decision MakingQ-Learning	—Unverified
Yes, Q-learning Helps Offline In-Context RL	Feb 24, 2025	In-Context Reinforcement LearningMuJoCo	—Unverified
Causal Mean Field Multi-Agent Reinforcement Learning	Feb 20, 2025	Multi-agent Reinforcement LearningQ-Learning	—Unverified
A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms	Feb 20, 2025	Q-Learning	—Unverified
Is Q-learning an Ill-posed Problem?	Feb 20, 2025	Q-Learningreinforcement-learning	—Unverified
Algorithmic Collusion under Observed Demand Shocks	Feb 20, 2025	Q-Learning	—Unverified
Multi-Objective Reinforcement Learning for Critical Scenario Generation of Autonomous Vehicles	Feb 18, 2025	Autonomous VehiclesMulti-Objective Reinforcement Learning	—Unverified
Few is More: Task-Efficient Skill-Discovery for Multi-Task Offline Multi-Agent Reinforcement Learning	Feb 13, 2025	Learning to ExecuteMulti-agent Reinforcement Learning	—Unverified
Evolution of cooperation in a bimodal mixture of conditional cooperators	Feb 11, 2025	Q-Learning	CodeCode Available
Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning	Feb 7, 2025	Deep Reinforcement LearningDiversity	—Unverified
Optimizing Wireless Resource Management and Synchronization in Digital Twin Networks	Feb 7, 2025	ManagementQ-Learning	—Unverified
Fast Adaptive Anti-Jamming Channel Access via Deep Q Learning and Coarse-Grained Spectrum Prediction	Feb 7, 2025	Deep Reinforcement LearningQ-Learning	—Unverified
CleanSurvival: Automated data preprocessing for time-to-event models using reinforcement learning	Feb 6, 2025	ImputationOutlier Detection	CodeCode Available
DECAF: Learning to be Fair in Multi-agent Resource Allocation	Feb 6, 2025	FairnessQ-Learning	—Unverified
Efficient Triangular Arbitrage Detection via Graph Neural Networks	Feb 5, 2025	Q-Learning	—Unverified
VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning	Feb 5, 2025	CPUManagement	—Unverified
Gap-Dependent Bounds for Federated Q-learning	Feb 5, 2025	Q-Learning	—Unverified
Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer	Feb 4, 2025	Q-LearningSMAC	CodeCode Available
Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning	Feb 3, 2025	Meta-LearningOffline RL	—Unverified
Computing and Learning Stationary Mean Field Equilibria with Scalar Interactions: Algorithms and Applications	Feb 2, 2025	counterfactualPolicy Gradient Methods	—Unverified
An MDP Model for Censoring in Harvesting Sensors: Optimal and Approximated Solutions	Feb 2, 2025	Q-Learning	—Unverified
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network	Feb 1, 2025	continuous-controlContinuous Control	—Unverified
Linear Q-Learning Does Not Diverge: Convergence Rates to a Bounded Set	Jan 31, 2025	Q-Learning	—Unverified
Reinforcement Learning for Quantum Circuit Design: Using Matrix Representations	Jan 27, 2025	Deep Reinforcement LearningQ-Learning	—Unverified

Show:10 25 50

← PrevPage 10 of 77Next →

No leaderboard results yet.