Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1301–1350 of 1918 papers

Title	Date	Tasks	Status	Hype
Runtime Adaptation in Wireless Sensor Nodes Using Structured Learning	Jun 15, 2020	Q-LearningReinforcement Learning (RL)	—Unverified	0
Self-Imitation Learning via Generalized Lower Bound Q-learning	Jun 12, 2020	continuous-controlContinuous Control	—Unverified	0
Safety-guaranteed Reinforcement Learning based on Multi-class Support Vector Machine	Jun 12, 2020	Q-Learningreinforcement-learning	—Unverified	0
Human and Multi-Agent collaboration in a human-MARL teaming framework	Jun 12, 2020	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
Deep Reinforcement Learning for Neural Control	Jun 12, 2020	Deep Reinforcement LearningQ-Learning	—Unverified	0
Decorrelated Double Q-learning	Jun 12, 2020	continuous-controlContinuous Control	—Unverified	0
Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework	Jun 11, 2020	Q-LearningReinforcement Learning (RL)	—Unverified	0
Zeroth-Order Supervised Policy Improvement	Jun 11, 2020	continuous-controlContinuous Control	—Unverified	0
Q-greedyUCB: a New Exploration Policy for Adaptive and Resource-efficient Scheduling	Jun 10, 2020	Decision MakingQ-Learning	—Unverified	0
Privacy-Cost Management in Smart Meters with Mutual Information-Based Reinforcement Learning	Jun 10, 2020	Deep Reinforcement LearningManagement	—Unverified	0
Multi-Agent Reinforcement Learning in a Realistic Limit Order Book Market Simulation	Jun 10, 2020	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
Model-Free Algorithm and Regret Analysis for MDPs with Long-Term Constraints	Jun 10, 2020	Q-Learning	—Unverified	0
Fitted Q-Learning for Relational Domains	Jun 10, 2020	Q-Learning	—Unverified	0
Self-Supervised Reinforcement Learning for Recommender Systems	Jun 10, 2020	Q-LearningRecommendation Systems	—Unverified	0
Reinforcement Learning-Based Joint Self-Optimisation Method for the Fuzzy Logic Handover Algorithm in 5G HetNets	Jun 9, 2020	ClusteringManagement	—Unverified	0
Balancing a CartPole System with Reinforcement Learning -- A Tutorial	Jun 8, 2020	OpenAI GymQ-Learning	—Unverified	0
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret	Jun 8, 2020	Q-Learningreinforcement-learning	—Unverified	0
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory	Jun 8, 2020	Deep Reinforcement LearningQ-Learning	—Unverified	0
Conservative Q-Learning for Offline Reinforcement Learning	Jun 8, 2020	continuous-controlContinuous Control	CodeCode Available	1
A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks	Jun 6, 2020	Q-LearningScheduling	—Unverified	0
Logical Team Q-learning: An approach towards factored policies in cooperative MARL	Jun 5, 2020	Q-Learning	—Unverified	0
Sample Complexity of Asynchronous Q-Learning: Sharper Analysis and Variance Reduction	Jun 4, 2020	Q-Learning	—Unverified	0
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines	Jun 4, 2020	Q-Learningreinforcement-learning	CodeCode Available	0
Multi-Agent Determinantal Q-Learning	Jun 2, 2020	Q-Learning	CodeCode Available	1
Mitigating Bias in Face Recognition Using Skewness-Aware Reinforcement Learning	Jun 1, 2020	Face RecognitionFairness	—Unverified	0
Hyperparameter optimization with REINFORCE and Transformers	Jun 1, 2020	BenchmarkingHyperparameter Optimization	—Unverified	0
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization	May 31, 2020	counterfactualMulti-agent Reinforcement Learning	—Unverified	0
Learning-Based Joint User-AP Association and Resource Allocation in Ultra Dense Network	May 28, 2020	Q-Learning	—Unverified	0
Modeling Penetration Testing with Reinforcement Learning Using Capture-the-Flag Challenges: Trade-offs between Model-free Learning and A Priori Knowledge	May 26, 2020	Q-Learningreinforcement-learning	CodeCode Available	1
Active Measure Reinforcement Learning for Observation Cost Minimization	May 26, 2020	Decision MakingQ-Learning	—Unverified	0
Deep Reinforcement Learning Based Power Allocation for D2D Network	May 25, 2020	Deep Reinforcement LearningQ-Learning	—Unverified	0
Should artificial agents ask for help in human-robot collaborative problem-solving?	May 25, 2020	Q-Learning	—Unverified	0
A reinforcement learning based decision support system in textile manufacturing process	May 20, 2020	Decision MakingQ-Learning	—Unverified	0
Safe Learning for Near Optimal Scheduling	May 19, 2020	Q-LearningScheduling	—Unverified	0
Entropy-Augmented Entropy-Regularized Reinforcement Learning and a Continuous Path from Policy Gradient to Q-Learning	May 18, 2020	Q-Learning	—Unverified	0
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation	May 18, 2020	Deep Reinforcement LearningQ-Learning	—Unverified	0
A Deep Q-learning/genetic Algorithms Based Novel Methodology For Optimizing Covid-19 Pandemic Government Actions	May 15, 2020	Q-Learning	—Unverified	0
A Deep Reinforcement Learning Approach to Efficient Drone Mobility Support	May 11, 2020	Deep Reinforcement LearningQ-Learning	—Unverified	0
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning	May 10, 2020	L2 RegularizationOpenAI Gym	—Unverified	0
Reinforcement Learning for Thermostatically Controlled Loads Control using Modelica and Python	May 9, 2020	Q-Learningreinforcement-learning	—Unverified	0
Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach	May 2, 2020	Deep Reinforcement LearningQ-Learning	—Unverified	0
Learning Efficient Parameter Server Synchronization Policies for Distributed SGD	May 1, 2020	Q-LearningReinforcement Learning (RL)	—Unverified	0
Implementing Inductive bias for different navigation tasks through diverse RNN attrractors	May 1, 2020	Inductive BiasQ-Learning	—Unverified	0
Whittle index based Q-learning for restless bandits with average reward	Apr 29, 2020	Q-Learningreinforcement-learning	—Unverified	0
Evolution of Q Values for Deep Q Learning in Stable Baselines	Apr 24, 2020	Q-LearningReinforcement Learning	—Unverified	0
Learning Dialog Policies from Weak Demonstrations	Apr 23, 2020	Atari GamesDeep Reinforcement Learning	—Unverified	0
Energy-Efficient Power Allocation and Q-Learning-Based Relay Selection for Relay-Aided D2D Communication	Apr 20, 2020	Q-Learning	—Unverified	0
Intelligent Querying for Target Tracking in Camera Networks using Deep Q-Learning with n-Step Bootstrapping	Apr 20, 2020	Q-LearningReinforcement Learning	—Unverified	0
Spatial Action Maps for Mobile Manipulation	Apr 20, 2020	Q-LearningValue prediction	CodeCode Available	1
Deep Reinforcement Learning for Adaptive Learning Systems	Apr 17, 2020	Deep Reinforcement LearningQ-Learning	—Unverified	0

Show:10 25 50

← PrevPage 27 of 39Next →

No leaderboard results yet.