Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1651–1700 of 1918 papers

Title	Date	Tasks	Status	Hype
Active Deep Q-learning with Demonstration	Dec 6, 2018	Q-Learningreinforcement-learning	—Unverified	0
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective	Dec 2, 2018	Atari GamesQ-Learning	CodeCode Available	0
Non-delusional Q-learning and value-iteration	Dec 1, 2018	Q-Learning	—Unverified	0
Urban Driving with Multi-Objective Deep Reinforcement Learning	Nov 21, 2018	Autonomous DrivingDeep Reinforcement Learning	CodeCode Available	0
Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning	Nov 19, 2018	Active LearningQ-Learning	CodeCode Available	0
Reinforcement Learning with A* and a Deep Heuristic	Nov 19, 2018	Q-Learningreinforcement-learning	CodeCode Available	0
Emergence of Addictive Behaviors in Reinforcement Learning Agents	Nov 14, 2018	Q-Learningreinforcement-learning	—Unverified	0
Deep Q learning for fooling neural networks	Nov 13, 2018	Q-LearningReinforcement Learning	CodeCode Available	0
Managing App Install Ad Campaigns in RTB: A Q-Learning Approach	Nov 11, 2018	Q-Learning	—Unverified	0
Deep Reinforcement Learning via L-BFGS Optimization	Nov 6, 2018	Atari GamesDeep Reinforcement Learning	—Unverified	0
Deep Reinforcement Learning for Green Security Games with Real-Time Information	Nov 6, 2018	Deep Reinforcement LearningQ-Learning	—Unverified	0
Reinforcement Learning based Dynamic Model Selection for Short-Term Load Forecasting	Nov 5, 2018	BIG-bench Machine LearningLoad Forecasting	—Unverified	0
Double Q-PID algorithm for mobile robot control	Nov 1, 2018	Active LearningQ-Learning	CodeCode Available	0
Approximate Dynamic Oracle for Dependency Parsing with Reinforcement Learning	Nov 1, 2018	Dependency ParsingImitation Learning	—Unverified	0
Structure Learning of Deep Neural Networks with Q-Learning	Oct 31, 2018	image-classificationImage Classification	—Unverified	0
Distributive Dynamic Spectrum Access through Deep Reinforcement Learning: A Reservoir Computing Based Approach	Oct 28, 2018	BIG-bench Machine LearningDeep Reinforcement Learning	—Unverified	0
Multi-Agent Reinforcement Learning Based Resource Allocation for UAV Networks	Oct 24, 2018	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
Learning Negotiating Behavior Between Cars in Intersections using Deep Q-Learning	Oct 24, 2018	Q-Learning	—Unverified	0
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement	Oct 22, 2018	Policy Gradient MethodsQ-Learning	CodeCode Available	0
Optimization of Molecules via Deep Reinforcement Learning	Oct 19, 2018	Deep Reinforcement LearningMolecular Graph Generation	CodeCode Available	1
Finding the best design parameters for optical nanostructures using reinforcement learning	Oct 18, 2018	BIG-bench Machine LearningQ-Learning	—Unverified	0
Assessing the Potential of Classical Q-learning in General Game Playing	Oct 14, 2018	Board GamesDeep Reinforcement Learning	CodeCode Available	0
Learning to Sketch with Deep Q Networks and Demonstrated Strokes	Oct 14, 2018	Q-Learning	—Unverified	0
Learning to Reason	Oct 12, 2018	Automated Theorem ProvingQ-Learning	—Unverified	0
Reinforcement Evolutionary Learning Method for self-learning	Oct 7, 2018	Incremental LearningMarketing	—Unverified	0
Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks	Oct 6, 2018	AllMontezuma's Revenge	CodeCode Available	0
Deep Quality-Value (DQV) Learning	Sep 30, 2018	Atari GamesDeep Reinforcement Learning	CodeCode Available	0
Reinforcement Learning in R	Sep 29, 2018	Q-Learningreinforcement-learning	—Unverified	0
Hybrid Policies Using Inverse Rewards for Reinforcement Learning	Sep 27, 2018	OpenAI GymQ-Learning	—Unverified	0
What Would pi* Do?: Imitation Learning via Off-Policy Reinforcement Learning	Sep 27, 2018	Imitation LearningQ-Learning	—Unverified	0
Accelerated Value Iteration via Anderson Mixing	Sep 27, 2018	Atari GamesQ-Learning	—Unverified	0
Convergent Reinforcement Learning with Function Approximation: A Bilevel Optimization Perspective	Sep 27, 2018	Bilevel OptimizationQ-Learning	—Unverified	0
A Convergent Variant of the Boltzmann Softmax Operator in Reinforcement Learning	Sep 27, 2018	Atari GamesQ-Learning	—Unverified	0
The wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions	Sep 27, 2018	Deep Reinforcement LearningPolicy Gradient Methods	—Unverified	0
Learning through Probing: a decentralized reinforcement learning architecture for social dilemmas	Sep 26, 2018	Deep Reinforcement LearningMulti-agent Reinforcement Learning	—Unverified	0
Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals	Sep 25, 2018	Q-Learningreinforcement-learning	—Unverified	0
Target Transfer Q-Learning and Its Convergence Analysis	Sep 21, 2018	Q-LearningReinforcement Learning	—Unverified	0
Model-Free Adaptive Optimal Control of Episodic Fixed-Horizon Manufacturing Processes using Reinforcement Learning	Sep 18, 2018	Model Predictive ControlQ-Learning	CodeCode Available	0
Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process	Sep 17, 2018	Q-Learning	—Unverified	0
Optimal Matrix Momentum Stochastic Approximation and Applications to Q-learning	Sep 17, 2018	Q-LearningReinforcement Learning	—Unverified	0
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning	Sep 15, 2018	Deep Reinforcement LearningQ-Learning	CodeCode Available	0
Sampled Policy Gradient for Learning to Play the Game Agar.io	Sep 15, 2018	Game DesignQ-Learning	CodeCode Available	0
Towards Better Interpretability in Deep Q-Networks	Sep 15, 2018	Deep Reinforcement LearningQ-Learning	CodeCode Available	0
Negative Update Intervals in Deep Multi-Agent Reinforcement Learning	Sep 13, 2018	Multi-agent Reinforcement LearningQ-Learning	CodeCode Available	1
Directed Exploration in PAC Model-Free Reinforcement Learning	Aug 31, 2018	Efficient Explorationmodel	—Unverified	0
MARL-FWC: Optimal Coordination of Freeway Traffic Control Measures	Aug 27, 2018	Multi-agent Reinforcement LearningQ-Learning	—Unverified	0
BlockQNN: Efficient Block-wise Neural Network Architecture Generation	Aug 16, 2018	GPUimage-classification	CodeCode Available	0
Automatic Derivation Of Formulas Using Reforcement Learning	Aug 15, 2018	Q-Learning	—Unverified	0
A Framework for Automated Cellular Network Tuning with Reinforcement Learning	Aug 13, 2018	ManagementQ-Learning	CodeCode Available	0
Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks	Aug 1, 2018	Deep Reinforcement LearningQ-Learning	CodeCode Available	0

Show:10 25 50

← PrevPage 34 of 39Next →

No leaderboard results yet.