SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 15511600 of 1918 papers

TitleStatusHype
Control of nonlinear, complex and black-boxed greenhouse system with reinforcement learningCode0
Towards Model-based Reinforcement Learning for Industry-near EnvironmentsCode0
Q-MIND: Defeating Stealthy DoS Attacks in SDN with a Machine-learning based Defense Framework0
Potential-Based Advice for Stochastic Policy Learning0
Photonic architecture for reinforcement learning0
Model-free Control of Chaos with Continuous Deep Q-learning0
An intelligent financial portfolio trading strategy using deep Q-learningCode0
Q-learning pour la r\'esolution des anaphores pronominales en langue arabe (Q-learning for pronominal anaphora resolution in Arabic texts)0
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog0
QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security GameCode0
Towards Empathic Deep Q-LearningCode0
Q-Learning Inspired Self-Tuning for Energy Efficiency in HPC0
In Hindsight: A Smooth Reward for Steady Exploration0
Deceptive Reinforcement Learning Under Adversarial Manipulations on Cost Signals0
Optimal Use of Experience in First Person Shooter Environments0
Neural networks with motivation0
Reinforcement Learning-Based Trajectory Design for the Aerial Base Stations0
Cache-Aided NOMA Mobile Edge Computing: A Reinforcement Learning Approach0
Reward Prediction Error as an Exploration Objective in Deep RL0
A Generalized Minimax Q-learning Algorithm for Two-Player Zero-Sum Stochastic Games0
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle0
Variance-reduced Q-learning is minimax optimal0
"Did You Hear That?" Learning to Play Video Games from Audio Cues0
Deep Reinforcement Learning with Discrete Normalized Advantage Functions for Resource Management in Network Slicing0
Deep Q-Learning for Directed Acyclic Graph Generation0
Risk-Sensitive Compact Decision Trees for Autonomous Execution in Presence of Simulated Market Response0
Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning0
Escaping the State of Nature: A Hobbesian Approach to Cooperation in Multi-agent Reinforcement Learning0
Reinforcement Learning with Low-Complexity Liquid State MachinesCode0
On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection0
Stabilizing Off-Policy Q-Learning via Bootstrapping Error ReductionCode0
Feature-Based Q-Learning for Two-Player Stochastic Games0
RSS-Based Q-Learning for Indoor UAV Navigation0
Provably Efficient Q-Learning with Low Switching Cost0
Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology0
Learning NP-Hard Multi-Agent Assignment Planning using GNN: Inference on a Random Graph and Provable Auction-Fitted Q-learning0
Solving NP-Hard Problems on Graphs with Extended AlphaGo ZeroCode0
A General Markov Decision Process Framework for Directly Learning Optimal Control Policies0
Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement LearningCode0
Prioritized Sequence Experience Replay0
A Kernel Loss for Solving the Bellman EquationCode0
Adaptive Symmetric Reward Noising for Reinforcement LearningCode0
Neural Temporal-Difference and Q-Learning Provably Converge to Global OptimaCode0
MQLV: Optimal Policy of Money Management in Retail Banking with Q-Learning0
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment0
Deep Reinforcement Learning Based Parameter Control in Differential EvolutionCode0
Stochastic Variance Reduction for Deep Q-learning0
Reinforcement Learning for Learning of Dynamical Systems in Uncertain Environment: a Tutorial0
QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature SelectionCode0
Autonomous Penetration Testing using Reinforcement Learning0
Show:102550
← PrevPage 32 of 39Next →

No leaderboard results yet.