Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–550 of 1918 papers

Title	Date	Tasks	Status
Deep Q-Network-Driven Catheter Segmentation in 3D US by Hybrid Constrained Semi-Supervised Learning and Dual-UNet	Jun 25, 2020	Q-Learning	—Unverified
Deep Q-Network for Stochastic Process Environments	Aug 7, 2023	Q-Learningreinforcement-learning	—Unverified
Attitude Control of Highly Maneuverable Aircraft Using an Improved Q-learning	Oct 22, 2022	continuous-controlContinuous Control	—Unverified
A Hybrid Q-Learning Sine-Cosine-based Strategy for Addressing the Combinatorial Test Suite Minimization Problem	Apr 27, 2018	Q-Learning	—Unverified
Deep Recurrent Q-learning for Energy-constrained Coverage with a Mobile Robot	Oct 1, 2022	Q-Learning	—Unverified
Analyzing Robustness of the Deep Reinforcement Learning Algorithm in Ramp Metering Applications Considering False Data Injection Attack and Defense	Jan 28, 2023	Adversarial AttackDeep Reinforcement Learning	—Unverified
A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities	Nov 5, 2020	Q-Learningreinforcement-learning	—Unverified
Deep Reinforcement Fuzzing	Jan 14, 2018	Q-Learningreinforcement-learning	—Unverified
Adaptive Stochastic Resource Control: A Machine Learning Approach	Jan 15, 2014	BIG-bench Machine LearningClustering	—Unverified
Deep reinforcement learning applied to an assembly sequence planning problem with user preferences	Apr 13, 2023	Decision MakingDeep Reinforcement Learning	—Unverified
Diff-Transfer: Model-based Robotic Manipulation Skill Transfer via Differentiable Physics Simulation	Oct 7, 2023	Q-Learning	—Unverified
Analytics of Business Time Series Using Machine Learning and Bayesian Inference	May 25, 2022	Bayesian InferenceBIG-bench Machine Learning	—Unverified
Bootstrapped Hindsight Experience replay with Counterintuitive Prioritization	Sep 29, 2021	Q-Learning	—Unverified
A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise	Jan 28, 2022	Q-Learningreinforcement-learning	—Unverified
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task	Oct 15, 2024	ARCDecision Making	—Unverified
Analytically Tractable Bayesian Deep Q-Learning	Jun 21, 2021	Q-Learningreinforcement-learning	—Unverified
Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit	Nov 18, 2022	Q-Learningreinforcement-learning	—Unverified
Boosting Offline Reinforcement Learning with Residual Generative Modeling	Jun 19, 2021	Offline RLQ-Learning	—Unverified
Automatic Reward Shaping from Confounded Offline Data	May 16, 2025	Atari GamesDeep Reinforcement Learning	—Unverified
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL	May 28, 2025	Bayesian OptimizationHyperparameter Optimization	—Unverified
BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch	Jan 23, 2025	Graph AttentionGraph Sampling	—Unverified
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent	Jul 15, 2020	Atari GamesQ-Learning	—Unverified
Analysis of Multiscale Reinforcement Q-Learning Algorithms for Mean Field Control Games	May 27, 2024	Q-Learning	—Unverified
Blackwell Online Learning for Markov Decision Processes	Dec 28, 2020	Learning TheoryQ-Learning	—Unverified
A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks	Apr 30, 2019	BenchmarkingDeep Reinforcement Learning	—Unverified
Differentiable Quantum Architecture Search for Quantum Reinforcement Learning	Sep 19, 2023	Q-LearningQuantum Machine Learning	—Unverified
Biomimetic Ultra-Broadband Perfect Absorbers Optimised with Reinforcement Learning	Oct 28, 2019	Q-Learningreinforcement-learning	—Unverified
BIBI System Description: Building with CNNs and Breaking with Deep Reinforcement Learning	Sep 1, 2017	Deep Reinforcement LearningQ-Learning	—Unverified
An Adiabatic Theorem for Policy Tracking with TD-learning	Oct 24, 2020	Q-Learning	—Unverified
Bias or Optimality? Disentangling Bayesian Inference and Learning Biases in Human Decision-Making	May 12, 2025	Bayesian InferenceDecision Making	—Unverified
A Multistep Lyapunov Approach for Finite-Time Analysis of Biased Stochastic Approximation	Sep 10, 2019	Q-LearningReinforcement Learning	—Unverified
A Deep Q-learning/genetic Algorithms Based Novel Methodology For Optimizing Covid-19 Pandemic Government Actions	May 15, 2020	Q-Learning	—Unverified
3D Simulation for Robot Arm Control with Deep Q-Learning	Sep 13, 2016	Deep Reinforcement LearningQ-Learning	—Unverified
Differentially Private Deep Q-Learning for Pattern Privacy Preservation in MEC Offloading	Feb 9, 2023	Edge-computingQ-Learning	—Unverified
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning	Aug 6, 1999	Q-Learningreinforcement-learning	—Unverified
Best Possible Q-Learning	Feb 2, 2023	Multi-agent Reinforcement LearningQ-Learning	—Unverified
A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks	Jun 6, 2020	Q-LearningScheduling	—Unverified
Benchmarking projective simulation in navigation problems	Apr 23, 2018	BenchmarkingQ-Learning	—Unverified
A Deep Q-Learning based Smart Scheduling of EVs for Demand Response in Smart Grids	Jan 5, 2024	Q-LearningScheduling	—Unverified
A Convergent Variant of the Boltzmann Softmax Operator in Reinforcement Learning	Sep 27, 2018	Atari GamesQ-Learning	—Unverified
Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways	Dec 6, 2020	Autonomous DrivingDecision Making	—Unverified
Amortized Noisy Channel Neural Machine Translation	Dec 16, 2021	Imitation LearningKnowledge Distillation	—Unverified
A Multi-Agent Reinforcement Learning Approach For Safe and Efficient Behavior Planning Of Connected Autonomous Vehicles	Mar 9, 2020	Autonomous VehiclesMulti-agent Reinforcement Learning	—Unverified
A deep Q-Learning based Path Planning and Navigation System for Firefighting Environments	Nov 12, 2020	Q-Learning	—Unverified
DGFN: Double Generative Flow Networks	Oct 30, 2023	Drug DiscoveryQ-Learning	—Unverified
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation	Oct 15, 2024	Decision MakingOffline RL	—Unverified
β-DQN: Improving Deep Q-Learning By Evolving the Behavior	Jan 1, 2025	Deep Reinforcement LearningEfficient Exploration	—Unverified
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems	Aug 17, 2016	Deep Reinforcement LearningEfficient Exploration	—Unverified
A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes	Oct 4, 2021	Q-Learningreinforcement-learning	—Unverified
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems	Nov 15, 2017	Deep Reinforcement LearningEfficient Exploration	—Unverified

Show:10 25 50

← PrevPage 11 of 39Next →

No leaderboard results yet.