Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1851–1900 of 1918 papers

Title	Date	Tasks	Status
Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments	Oct 9, 2020	Incremental LearningQ-Learning	CodeCode Available
Policy Iterations for Reinforcement Learning Problems in Continuous Time and Space -- Fundamental Theory and Methods	May 9, 2017	Decision MakingQ-Learning	CodeCode Available
NARS vs. Reinforcement learning: ONA vs. Q-Learning	Dec 23, 2022	Q-Learningreinforcement-learning	CodeCode Available
Privacy-Preserving Q-Learning with Functional Noise in Continuous Spaces	Dec 1, 2019	Privacy PreservingQ-Learning	CodeCode Available
Privacy-preserving Q-Learning with Functional Noise in Continuous State Spaces	Jan 30, 2019	Privacy PreservingQ-Learning	CodeCode Available
A Multi-Step Minimax Q-learning Algorithm for Two-Player Zero-Sum Markov Games	Jul 5, 2024	Q-Learning	CodeCode Available
Probing Implicit Bias in Semi-gradient Q-learning: Visualizing the Effective Loss Landscapes via the Fokker--Planck Equation	Jun 12, 2024	Q-Learning	CodeCode Available
Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning	Nov 19, 2018	Active LearningQ-Learning	CodeCode Available
A Machine with Short-Term, Episodic, and Semantic Memory Systems	Dec 5, 2022	Q-LearningReinforcement Learning (RL)	CodeCode Available
Intelligent Masking: Deep Q-Learning for Context Encoding in Medical Image Analysis	Mar 25, 2022	Medical Image AnalysisQ-Learning	CodeCode Available
Assumed Density Filtering Q-learning	Dec 9, 2017	Atari GamesBayesian Inference	CodeCode Available
Propagating Uncertainty in Reinforcement Learning via Wasserstein Barycenters	Dec 1, 2019	Atari GamesQ-Learning	CodeCode Available
Robust Q-Learning for finite ambiguity sets	Jul 5, 2024	Q-Learning	CodeCode Available
Cooperation between Independent Market Makers	Jun 11, 2022	Q-Learning	CodeCode Available
Robust Q-Learning under Corrupted Rewards	Sep 5, 2024	Q-Learning	CodeCode Available
Solving Deep Reinforcement Learning Tasks with Evolution Strategies and Linear Policy Networks	Feb 10, 2024	Atari GamesDeep Reinforcement Learning	CodeCode Available
Active exploration in parameterized reinforcement learning	Oct 6, 2016	Meta-LearningQ-Learning	CodeCode Available
Solving NP-Hard Problems on Graphs with Extended AlphaGo Zero	May 28, 2019	Combinatorial OptimizationGraph Neural Network	CodeCode Available
Control with adaptive Q-learning	Nov 3, 2020	OpenAI GymQ-Learning	CodeCode Available
The Mean-Squared Error of Double Q-Learning	Jul 9, 2020	Q-Learning	CodeCode Available
Synthesis of Temporally-Robust Policies for Signal Temporal Logic Tasks using Reinforcement Learning	Dec 10, 2023	Q-Learning	CodeCode Available
Inverse Q-Learning Done Right: Offline Imitation Learning in Q^π-Realizable MDPs	May 26, 2025	Imitation LearningQ-Learning	CodeCode Available
SABER: Data-Driven Motion Planner for Autonomously Navigating Heterogeneous Robots	Aug 3, 2021	Model Predictive ControlMotion Planning	CodeCode Available
Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning	Nov 30, 2021	Autonomous VehiclesQ-Learning	CodeCode Available
Solving The Lunar Lander Problem under Uncertainty using Reinforcement Learning	Nov 24, 2020	NavigateQ-Learning	CodeCode Available
Investigating the Performance and Reliability, of the Q-Learning Algorithm in Various Unknown Environments	Dec 19, 2023	OpenAI GymPathfinder	CodeCode Available
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima	May 24, 2019	Deep Reinforcement LearningQ-Learning	CodeCode Available
I Open at the Close: A Deep Reinforcement Learning Evaluation of Open Streets Initiatives	Dec 12, 2023	Deep Reinforcement LearningGraph Neural Network	CodeCode Available
Assessing the Potential of Classical Q-learning in General Game Playing	Oct 14, 2018	Board GamesDeep Reinforcement Learning	CodeCode Available
Deep Reinforcement Learning for Multi-class Imbalanced Training	May 24, 2022	Deep Reinforcement Learningimbalanced classification	CodeCode Available
A Deep Recurrent Q Network towards Self-adapting Distributed Microservices architecture	Jan 13, 2019	Decision MakingQ-Learning	CodeCode Available
ISL: A novel approach for deep exploration	Sep 13, 2019	Q-LearningReinforcement Learning	CodeCode Available
Reinforcement-Learning based routing for packet-optical networks with hybrid telemetry	Jun 18, 2024	Q-Learning	CodeCode Available
Deep Reinforcement Learning for Imbalanced Classification	Jan 5, 2019	ClassificationDecision Making	CodeCode Available
Think Smart, Act SMARL! Analyzing Probabilistic Logic Shields for Multi-Agent Reinforcement Learning	Nov 7, 2024	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available
Provably efficient RL with Rich Observations via Latent State Decoding	Jan 25, 2019	ClusteringQ-Learning	CodeCode Available
Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems	Apr 19, 2018	Q-LearningStochastic Optimization	CodeCode Available
Join Query Optimization with Deep Reinforcement Learning Algorithms	Nov 26, 2019	AttributeDeep Reinforcement Learning	CodeCode Available
Visual Exploration and Energy-aware Path Planning via Reinforcement Learning	Sep 26, 2019	Autonomous Vehiclesobject-detection	CodeCode Available
Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target	Jan 22, 2019	Deep Reinforcement LearningQ-Learning	CodeCode Available
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization	May 28, 2024	D4RLOffline RL	CodeCode Available
Joint Path planning and Power Allocation of a Cellular-Connected UAV using Apprenticeship Learning via Deep Inverse Reinforcement Learning	Jun 15, 2023	Deep Reinforcement LearningQ-Learning	CodeCode Available
ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning	May 6, 2016	Atari GamesFPS Games	CodeCode Available
Classification with Costly Features using Deep Reinforcement Learning	Nov 20, 2017	ClassificationClassification with Costly Features	CodeCode Available
Active Collection of Well-Being and Health Data in Mobile Devices	Jul 7, 2023	Q-LearningReinforcement Learning (RL)	CodeCode Available
QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature Selection	May 16, 2019	feature selectionMulti-agent Reinforcement Learning	CodeCode Available
Taming the Noise in Reinforcement Learning via Soft Updates	Dec 28, 2015	Q-Learningreinforcement-learning	CodeCode Available
Topological Experience Replay	Mar 29, 2022	Q-Learning	CodeCode Available
A Kernel Loss for Solving the Bellman Equation	May 25, 2019	Q-LearningReinforcement Learning	CodeCode Available
Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling	Sep 16, 2024	Combinatorial Optimizationcounterfactual	CodeCode Available

Show:10 25 50

← PrevPage 38 of 39Next →

No leaderboard results yet.