SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 13811390 of 1918 papers

TitleStatusHype
Two Timescale Convergent Q-learning for Sleep--Scheduling in Wireless Sensor Networks0
Two-Timescale Networks for Nonlinear Value Function Approximation0
Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games0
UAV Aided Search and Rescue Operation Using Reinforcement Learning0
UAV-Assisted Space-Air-Ground Integrated Networks: A Technical Review of Recent Learning Algorithms0
UAV Base Station Trajectory Optimization Based on Reinforcement Learning in Post-disaster Search and Rescue Operations0
UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning0
UCB Exploration via Q-Ensembles0
Unbiased Methods for Multi-Goal Reinforcement Learning0
Uncertainty Weighted Offline Reinforcement Learning0
Show:102550
← PrevPage 139 of 192Next →

No leaderboard results yet.