Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–900 of 1918 papers

Title	Date	Tasks	Status
Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning	Jun 1, 2018	Hyperparameter OptimizationObject Tracking	—Unverified
HyperQ-Opt: Q-learning for Hyperparameter Optimization	Dec 23, 2024	Bayesian OptimizationHyperparameter Optimization	—Unverified
Action Learning for 3D Point Cloud Based Organ Segmentation	Jun 14, 2018	Organ SegmentationQ-Learning	—Unverified
Enhanced Deep Q-Learning for 2D Self-Driving Cars: Implementation and Evaluation on a Custom Track Environment	Feb 13, 2024	Q-LearningSelf-Driving Cars	—Unverified
Energy Sharing for Multiple Sensor Nodes with Finite Buffers	Mar 17, 2015	Q-Learning	—Unverified
Cell Switching in HAPS-Aided Networking: How the Obscurity of Traffic Loads Affects the Decision	May 1, 2024	Q-Learning	—Unverified
Imagination-Limited Q-Learning for Offline Reinforcement Learning	May 18, 2025	D4RLQ-Learning	—Unverified
Imitating Language via Scalable Inverse Reinforcement Learning	Sep 2, 2024	DiversityImitation Learning	—Unverified
Implementing Inductive bias for different navigation tasks through diverse RNN attrractors	May 1, 2020	Inductive BiasQ-Learning	—Unverified
Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization	Jun 24, 2020	Combinatorial OptimizationDeep Reinforcement Learning	—Unverified
Implicit Constraint-Aware Off-Policy Correction for Offline Reinforcement Learning	Jun 16, 2025	Q-Learning	—Unverified
Improved Q-learning based Multi-hop Routing for UAV-Assisted Communication	Aug 17, 2024	Collision AvoidanceQ-Learning	—Unverified
Energy-Efficient Power Allocation and Q-Learning-Based Relay Selection for Relay-Aided D2D Communication	Apr 20, 2020	Q-Learning	—Unverified
Improve Value Estimation of Q Function and Reshape Reward with Monte Carlo Tree Search	Oct 15, 2024	Q-Learning	—Unverified
A new convergent variant of Q-learning with linear function approximation	Dec 1, 2020	Q-LearningReinforcement Learning (RL)	—Unverified
Improving Performance of Spike-based Deep Q-Learning using Ternary Neurons	Jun 3, 2025	Atari GamesDecision Making	—Unverified
Energy Consumption and Battery Aging Minimization Using a Q-learning Strategy for a Battery/Ultracapacitor Electric Vehicle	Oct 27, 2020	energy managementManagement	—Unverified
Improving Search through A3C Reinforcement Learning based Conversational Agent	Sep 17, 2017	Q-Learningreinforcement-learning	—Unverified
Causal Mean Field Multi-Agent Reinforcement Learning	Feb 20, 2025	Multi-agent Reinforcement LearningQ-Learning	—Unverified
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action	Oct 4, 2019	Industrial RobotsQ-Learning	—Unverified
Energy-aware optimization of UAV base stations placement via decentralized multi-agent Q-learning	Jun 1, 2021	Decision MakingQ-Learning	—Unverified
Energy and Service-priority aware Trajectory Design for UAV-BSs using Double Q-Learning	Oct 26, 2020	Q-Learning	—Unverified
Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning	Mar 18, 2022	Deep Reinforcement LearningQ-Learning	—Unverified
Causal Deep Reinforcement Learning Using Observational Data	Nov 28, 2022	Autonomous DrivingCausal Inference	—Unverified
A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward	Sep 24, 2020	Decision MakingQ-Learning	—Unverified
Information Theoretic Model Predictive Q-Learning	Dec 31, 2019	Decision Makingmodel	—Unverified
A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization	Jul 1, 2024	Deep Reinforcement Learningenergy management	—Unverified
In Hindsight: A Smooth Reward for Steady Exploration	Jun 24, 2019	Atari GamesQ-Learning	—Unverified
In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning	Jul 23, 2024	Multi-Objective Reinforcement LearningQ-Learning	—Unverified
Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning	Jun 28, 2021	Q-Learning	—Unverified
EnCoMP: Enhanced Covert Maneuver Planning with Adaptive Threat-Aware Visibility Estimation using Offline Reinforcement Learning	Mar 29, 2024	NavigateQ-Learning	—Unverified
Encoders and Decoders for Quantum Expander Codes Using Machine Learning	Sep 6, 2019	BIG-bench Machine LearningDecoder	—Unverified
Integrated Freeway Traffic Control Using Q-Learning with Adjacent Arterial Traffic Considerations	Oct 25, 2023	Q-Learning	—Unverified
Integrated Sensing and Communication Neighbor Discovery for MANET with Gossip Mechanism	Oct 11, 2023	Integrated sensing and communicationISAC	—Unverified
Integrated trucks assignment and scheduling problem with mixed service mode docks: A Q-learning based adaptive large neighborhood search algorithm	Dec 12, 2024	Q-LearningScheduling	—Unverified
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments	Oct 9, 2019	Q-Learningreinforcement-learning	—Unverified
Catch Me If You Can: Improving Adversaries in Cyber-Security With Q-Learning Algorithms	Feb 7, 2023	Q-Learning	—Unverified
Intelligent Agricultural Management Considering N_2O Emission and Climate Variability with Uncertainties	Feb 13, 2024	Decision MakingManagement	—Unverified
Intelligent Autonomous Intersection Management	Feb 9, 2022	Autonomous VehiclesManagement	—Unverified
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL	Apr 15, 2024	GPUOffline RL	—Unverified
Intelligent O-RAN Traffic Steering for URLLC Through Deep Reinforcement Learning	Mar 3, 2023	Deep Reinforcement LearningQ-Learning	—Unverified
Intelligent Querying for Target Tracking in Camera Networks using Deep Q-Learning with n-Step Bootstrapping	Apr 20, 2020	Q-LearningReinforcement Learning	—Unverified
Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving	Apr 28, 2025	Autonomous DrivingQ-Learning	—Unverified
Interactive Learning from Natural Language and Demonstrations using Signal Temporal Logic	Jul 1, 2022	Formal LogicQ-Learning	—Unverified
Empirical Q-Value Iteration	Nov 30, 2014	Q-Learning	—Unverified
Internet of Things Applications: Animal Monitoring with Unmanned Aerial Vehicle	Oct 17, 2016	Q-LearningTraveling Salesman Problem	—Unverified
Deep Constrained Q-learning	Mar 20, 2020	Autonomous DrivingDecision Making	—Unverified
Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders	Oct 3, 2022	Deep Reinforcement LearningQ-Learning	—Unverified
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective	May 12, 2021	Offline RLQ-Learning	—Unverified
An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS	May 26, 2024	Decision MakingQ-Learning	—Unverified

Show:10 25 50

← PrevPage 18 of 39Next →

No leaderboard results yet.