SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 13511400 of 1918 papers

TitleStatusHype
The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line0
The Sample-Communication Complexity Trade-off in Federated Q-Learning0
The Sample Complexity of Teaching-by-Reinforcement on Q-Learning0
The tree reconstruction game: phylogenetic reconstruction using reinforcement learning0
The Value of Chess Squares0
The wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions0
Throughput and Latency in the Distributed Q-Learning Random Access mMTC Networks0
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis0
Time-Scale Separation in Q-Learning: Extending TD() for Action-Value Function Decomposition0
Towards a Deep Reinforcement Learning Approach for Tower Line Wars0
A step toward a reinforcement learning de novo genome assembler0
Towards Characterizing Divergence in Deep Q-Learning0
Towards Learning to Speak and Hear Through Multi-Agent Communication over a Continuous Acoustic Channel0
Towards Resilience for Multi-Agent QD-Learning0
Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control0
Towards Secure and Efficient Data Scheduling for Vehicular Social Networks0
Autonomous Airline Revenue Management: A Deep Reinforcement Learning Approach to Seat Inventory Control and Overbooking0
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization0
Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning0
Towards Unknown-aware Deep Q-Learning0
Toward Synergic Learning for Autonomous Manipulation of Deformable Tissues via Surgical Robots: An Approximate Q-Learning Approach0
Trading the Twitter Sentiment with Reinforcement Learning0
Traffic Signal Control and Speed Offset Coordination Using Q-Learning for Arterial Road Networks0
Transfer Learning in Multi-Agent Reinforcement Learning with Double Q-Networks for Distributed Resource Sharing in V2X Communication0
Transferred Q-learning0
Transfer Reinforcement Learning under Unobserved Contextual Information0
Tuning Path Tracking Controllers for Autonomous Cars Using Reinforcement Learning0
Two Phase Q-learning for Bidding-based Vehicle Sharing0
Two-stage WECC Composite Load Modeling: A Double Deep Q-Learning Networks Approach0
Two-Step Q-Learning0
Two Timescale Convergent Q-learning for Sleep--Scheduling in Wireless Sensor Networks0
Two-Timescale Networks for Nonlinear Value Function Approximation0
Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games0
UAV Aided Search and Rescue Operation Using Reinforcement Learning0
UAV-Assisted Space-Air-Ground Integrated Networks: A Technical Review of Recent Learning Algorithms0
UAV Base Station Trajectory Optimization Based on Reinforcement Learning in Post-disaster Search and Rescue Operations0
UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning0
UCB Exploration via Q-Ensembles0
Unbiased Methods for Multi-Goal Reinforcement Learning0
Uncertainty Weighted Offline Reinforcement Learning0
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective0
Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization0
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration0
Unified continuous-time q-learning for mean-field game and mean-field control problems0
Unified ODE Analysis of Smooth Q-Learning Algorithms0
Unified Reinforcement Q-Learning for Mean Field Game and Control Problems0
Unifying Ensemble Methods for Q-learning via Social Choice Theory0
Unifying Top-down and Bottom-up for Recurrent Visual Attention0
Universal Approximation Theorem for Deep Q-Learning via FBSDE System0
Universal Approximation Theorem of Deep Q-Networks0
Show:102550
← PrevPage 28 of 39Next →

No leaderboard results yet.