SOTAVerified|Agents Browse Leaderboard About

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 826–850 of 1918 papers

Title	Date	Tasks	Status
Convert Language Model into a Value-based Strategic Planner	May 11, 2025	Language ModelingLanguage Modelling	—Unverified
Harnessing Deep Q-Learning for Enhanced Statistical Arbitrage in High-Frequency Trading: A Comprehensive Exploration	Sep 13, 2023	Decision MakingQ-Learning	—Unverified
Deep Robot Sketching: An application of Deep Q-Learning Networks for human-like sketching	Feb 1, 2024	Q-Learningreinforcement-learning	—Unverified
HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search	Nov 1, 2024	Q-Learning	—Unverified
Hedging of Financial Derivative Contracts via Monte Carlo Tree Search	Feb 11, 2021	Q-Learningreinforcement-learning	—Unverified
Hedging using reinforcement learning: Contextual k-Armed Bandit versus Q-learning	Jul 3, 2020	FrictionQ-Learning	—Unverified
Cooperation and Reputation Dynamics with Reinforcement Learning	Feb 15, 2021	Q-Learningreinforcement-learning	—Unverified
Hidden Incentives for Auto-Induced Distributional Shift	Sep 19, 2020	BIG-bench Machine LearningMeta-Learning	—Unverified
Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process	Sep 17, 2018	Q-Learning	—Unverified
Hierarchical clustering with deep Q-learning	May 28, 2018	ClusteringQ-Learning	—Unverified
Cooperative Control of Mobile Robots with Stackelberg Learning	Aug 3, 2020	Deep Reinforcement LearningQ-Learning	—Unverified
Hierarchical Deep Q-Learning Based Handover in Wireless Networks with Dual Connectivity	Jan 13, 2023	Q-Learningreinforcement-learning	—Unverified
Hierarchical Modular Reinforcement Learning Method and Knowledge Acquisition of State-Action Rule for Multi-target Problem	Apr 8, 2018	PositionQ-Learning	—Unverified
Cooperative Optimal Output Tracking for Discrete-Time Multiagent Systems: Stabilizing Policy Iteration Frameworks and Analysis	Jan 11, 2025	Q-Learning	—Unverified
High dimensional precision medicine from patient-derived xenografts	Dec 13, 2019	Q-LearningVocal Bursts Intensity Prediction	—Unverified
High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning	Dec 9, 2021	Deep Reinforcement LearningQ-Learning	—Unverified
Highway Reinforcement Learning	May 28, 2024	Q-Learningreinforcement-learning	—Unverified
Hippocampal representations emerge when training recurrent neural networks on a memory dependent maze navigation task	Dec 2, 2020	HippocampusQ-Learning	—Unverified
How to discretize continuous state-action spaces in Q-learning: A symbolic control approach	Jun 3, 2024	Q-Learning	—Unverified
Human and Multi-Agent collaboration in a human-MARL teaming framework	Jun 12, 2020	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Hybridizing the 1/5-th Success Rule with Q-Learning for Controlling the Mutation Rate of an Evolutionary Algorithm	Jun 19, 2020	Evolutionary AlgorithmsQ-Learning	—Unverified
Hybrid LLM-DDQN based Joint Optimization of V2I Communication and Autonomous Driving	Oct 11, 2024	Autonomous DrivingDecision Making	—Unverified
Hybrid Policies Using Inverse Rewards for Reinforcement Learning	Sep 27, 2018	OpenAI GymQ-Learning	—Unverified
Hybrid Q-Learning Applied to Ubiquitous recommender system	Mar 10, 2013	Q-LearningRecommendation Systems	—Unverified
A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts	Aug 15, 2024	Autonomous DrivingAutonomous Vehicles	—Unverified

Show:10 25 50

← PrevPage 34 of 77Next →

No leaderboard results yet.