Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1551–1600 of 1918 papers

Title	Date	Tasks	Status
Control of nonlinear, complex and black-boxed greenhouse system with reinforcement learning	Jul 30, 2019	Q-Learningreinforcement-learning	CodeCode Available
Towards Model-based Reinforcement Learning for Industry-near Environments	Jul 27, 2019	Deep Reinforcement LearningModel-based Reinforcement Learning	CodeCode Available
Q-MIND: Defeating Stealthy DoS Attacks in SDN with a Machine-learning based Defense Framework	Jul 27, 2019	Anomaly DetectionBIG-bench Machine Learning	—Unverified
Potential-Based Advice for Stochastic Policy Learning	Jul 20, 2019	Q-LearningReinforcement Learning	—Unverified
Photonic architecture for reinforcement learning	Jul 17, 2019	Active LearningQ-Learning	—Unverified
Model-free Control of Chaos with Continuous Deep Q-learning	Jul 16, 2019	Q-Learningreinforcement-learning	—Unverified
An intelligent financial portfolio trading strategy using deep Q-learning	Jul 8, 2019	Q-Learning	CodeCode Available
Q-learning pour la r\'esolution des anaphores pronominales en langue arabe (Q-learning for pronominal anaphora resolution in Arabic texts)	Jul 1, 2019	Q-Learning	—Unverified
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog	Jun 30, 2019	Deep Reinforcement LearningOpen-Domain Dialog	—Unverified
QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security Game	Jun 27, 2019	OpenAI GymQ-Learning	CodeCode Available
Towards Empathic Deep Q-Learning	Jun 26, 2019	EthicsQ-Learning	CodeCode Available
Q-Learning Inspired Self-Tuning for Energy Efficiency in HPC	Jun 26, 2019	Q-Learning	—Unverified
In Hindsight: A Smooth Reward for Steady Exploration	Jun 24, 2019	Atari GamesQ-Learning	—Unverified
Deceptive Reinforcement Learning Under Adversarial Manipulations on Cost Signals	Jun 24, 2019	Q-Learningreinforcement-learning	—Unverified
Optimal Use of Experience in First Person Shooter Environments	Jun 24, 2019	Q-LearningReinforcement Learning	—Unverified
Neural networks with motivation	Jun 23, 2019	Hierarchical Reinforcement LearningNavigate	—Unverified
Reinforcement Learning-Based Trajectory Design for the Aerial Base Stations	Jun 23, 2019	Q-Learningreinforcement-learning	—Unverified
Cache-Aided NOMA Mobile Edge Computing: A Reinforcement Learning Approach	Jun 20, 2019	Edge-computingQ-Learning	—Unverified
Reward Prediction Error as an Exploration Objective in Deep RL	Jun 19, 2019	Atari GamesContinuous Control	—Unverified
A Generalized Minimax Q-learning Algorithm for Two-Player Zero-Sum Stochastic Games	Jun 16, 2019	Q-Learning	—Unverified
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle	Jun 14, 2019	Q-Learningreinforcement-learning	—Unverified
Variance-reduced Q-learning is minimax optimal	Jun 11, 2019	Q-Learning	—Unverified
"Did You Hear That?" Learning to Play Video Games from Audio Cues	Jun 10, 2019	Game DesignNavigate	—Unverified
Deep Reinforcement Learning with Discrete Normalized Advantage Functions for Resource Management in Network Slicing	Jun 10, 2019	Deep Reinforcement LearningManagement	—Unverified
Deep Q-Learning for Directed Acyclic Graph Generation	Jun 5, 2019	Deep Reinforcement LearningGraph Generation	—Unverified
Risk-Sensitive Compact Decision Trees for Autonomous Execution in Presence of Simulated Market Response	Jun 5, 2019	Q-Learningreinforcement-learning	—Unverified
Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning	Jun 5, 2019	Multi-agent Reinforcement LearningQ-Learning	—Unverified
Escaping the State of Nature: A Hobbesian Approach to Cooperation in Multi-agent Reinforcement Learning	Jun 5, 2019	Multi-agent Reinforcement LearningPhilosophy	—Unverified
Reinforcement Learning with Low-Complexity Liquid State Machines	Jun 4, 2019	Atari GamesDeep Reinforcement Learning	CodeCode Available
On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection	Jun 4, 2019	Deep Reinforcement LearningQ-Learning	—Unverified
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction	Jun 3, 2019	continuous-controlContinuous Control	CodeCode Available
Feature-Based Q-Learning for Two-Player Stochastic Games	Jun 2, 2019	Q-LearningVocal Bursts Valence Prediction	—Unverified
RSS-Based Q-Learning for Indoor UAV Navigation	May 31, 2019	Q-Learning	—Unverified
Provably Efficient Q-Learning with Low Switching Cost	May 30, 2019	Q-Learning	—Unverified
Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology	May 29, 2019	Q-LearningRecommendation Systems	—Unverified
Learning NP-Hard Multi-Agent Assignment Planning using GNN: Inference on a Random Graph and Provable Auction-Fitted Q-learning	May 29, 2019	Combinatorial OptimizationDecision Making	—Unverified
Solving NP-Hard Problems on Graphs with Extended AlphaGo Zero	May 28, 2019	Combinatorial OptimizationGraph Neural Network	CodeCode Available
A General Markov Decision Process Framework for Directly Learning Optimal Control Policies	May 28, 2019	Q-LearningReinforcement Learning	—Unverified
Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning	May 27, 2019	Q-Learningreinforcement-learning	CodeCode Available
Prioritized Sequence Experience Replay	May 25, 2019	Deep Reinforcement LearningQ-Learning	—Unverified
A Kernel Loss for Solving the Bellman Equation	May 25, 2019	Q-LearningReinforcement Learning	CodeCode Available
Adaptive Symmetric Reward Noising for Reinforcement Learning	May 24, 2019	Autonomous DrivingQ-Learning	CodeCode Available
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima	May 24, 2019	Deep Reinforcement LearningQ-Learning	CodeCode Available
MQLV: Optimal Policy of Money Management in Retail Banking with Q-Learning	May 24, 2019	Decision MakingManagement	—Unverified
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment	May 23, 2019	OpenAI GymQ-Learning	—Unverified
Deep Reinforcement Learning Based Parameter Control in Differential Evolution	May 20, 2019	Deep Reinforcement LearningQ-Learning	CodeCode Available
Stochastic Variance Reduction for Deep Q-learning	May 20, 2019	Deep Reinforcement LearningQ-Learning	—Unverified
Reinforcement Learning for Learning of Dynamical Systems in Uncertain Environment: a Tutorial	May 19, 2019	Q-Learningreinforcement-learning	—Unverified
QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature Selection	May 16, 2019	feature selectionMulti-agent Reinforcement Learning	CodeCode Available
Autonomous Penetration Testing using Reinforcement Learning	May 15, 2019	Q-Learningreinforcement-learning	—Unverified

Show:10 25 50

← PrevPage 32 of 39Next →

No leaderboard results yet.