SOTAVerified|Agents Browse Leaderboard About Blog

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 1918 papers

Title	Date	Tasks	Status	Hype
Yes, Q-learning Helps Offline In-Context RL	Feb 24, 2025	In-Context Reinforcement LearningMuJoCo	—Unverified	0
Algorithmic Collusion under Observed Demand Shocks	Feb 20, 2025	Q-Learning	—Unverified	0
Is Q-learning an Ill-posed Problem?	Feb 20, 2025	Q-Learningreinforcement-learning	—Unverified	0
Causal Mean Field Multi-Agent Reinforcement Learning	Feb 20, 2025	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms	Feb 20, 2025	Q-Learning	—Unverified	0
Multi-Objective Reinforcement Learning for Critical Scenario Generation of Autonomous Vehicles	Feb 18, 2025	Autonomous VehiclesMulti-Objective Reinforcement Learning	—Unverified	0
Digi-Q: Learning Q-Value Functions for Training Device-Control Agents	Feb 13, 2025	Q-LearningReinforcement Learning (RL)	CodeCode Available	2
Few is More: Task-Efficient Skill-Discovery for Multi-Task Offline Multi-Agent Reinforcement Learning	Feb 13, 2025	Learning to ExecuteMulti-agent Reinforcement Learning	—Unverified	0
Evolution of cooperation in a bimodal mixture of conditional cooperators	Feb 11, 2025	Q-Learning	CodeCode Available	0
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy	Feb 8, 2025	Q-LearningSafe Exploration	CodeCode Available	3
Optimizing Wireless Resource Management and Synchronization in Digital Twin Networks	Feb 7, 2025	ManagementQ-Learning	—Unverified	0
Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning	Feb 7, 2025	Deep Reinforcement LearningDiversity	—Unverified	0
Fast Adaptive Anti-Jamming Channel Access via Deep Q Learning and Coarse-Grained Spectrum Prediction	Feb 7, 2025	Deep Reinforcement LearningQ-Learning	—Unverified	0
CleanSurvival: Automated data preprocessing for time-to-event models using reinforcement learning	Feb 6, 2025	ImputationOutlier Detection	CodeCode Available	0
DECAF: Learning to be Fair in Multi-agent Resource Allocation	Feb 6, 2025	FairnessQ-Learning	—Unverified	0
VistaFlow: Photorealistic Volumetric Reconstruction with Dynamic Resolution Management via Q-Learning	Feb 5, 2025	CPUManagement	—Unverified	0
Gap-Dependent Bounds for Federated Q-learning	Feb 5, 2025	Q-Learning	—Unverified	0
Efficient Triangular Arbitrage Detection via Graph Neural Networks	Feb 5, 2025	Q-Learning	—Unverified	0
Flow Q-Learning	Feb 4, 2025	Action GenerationD4RL	CodeCode Available	3
Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer	Feb 4, 2025	Q-LearningSMAC	CodeCode Available	0
Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning	Feb 3, 2025	Meta-LearningOffline RL	—Unverified	0
Computing and Learning Stationary Mean Field Equilibria with Scalar Interactions: Algorithms and Applications	Feb 2, 2025	counterfactualPolicy Gradient Methods	—Unverified	0
An MDP Model for Censoring in Harvesting Sensors: Optimal and Approximated Solutions	Feb 2, 2025	Q-Learning	—Unverified	0
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network	Feb 1, 2025	continuous-controlContinuous Control	—Unverified	0
Linear Q-Learning Does Not Diverge: Convergence Rates to a Bounded Set	Jan 31, 2025	Q-Learning	—Unverified	0

Show:10 25 50

← PrevPage 4 of 77Next →

No leaderboard results yet.