SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 13011325 of 1918 papers

TitleStatusHype
Runtime Adaptation in Wireless Sensor Nodes Using Structured Learning0
Self-Imitation Learning via Generalized Lower Bound Q-learning0
Safety-guaranteed Reinforcement Learning based on Multi-class Support Vector Machine0
Human and Multi-Agent collaboration in a human-MARL teaming framework0
Deep Reinforcement Learning for Neural Control0
Decorrelated Double Q-learning0
Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework0
Zeroth-Order Supervised Policy Improvement0
Q-greedyUCB: a New Exploration Policy for Adaptive and Resource-efficient Scheduling0
Privacy-Cost Management in Smart Meters with Mutual Information-Based Reinforcement Learning0
Multi-Agent Reinforcement Learning in a Realistic Limit Order Book Market Simulation0
Model-Free Algorithm and Regret Analysis for MDPs with Long-Term Constraints0
Fitted Q-Learning for Relational Domains0
Self-Supervised Reinforcement Learning for Recommender Systems0
Reinforcement Learning-Based Joint Self-Optimisation Method for the Fuzzy Logic Handover Algorithm in 5G HetNets0
Balancing a CartPole System with Reinforcement Learning -- A Tutorial0
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret0
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory0
Conservative Q-Learning for Offline Reinforcement LearningCode1
A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks0
Logical Team Q-learning: An approach towards factored policies in cooperative MARL0
Sample Complexity of Asynchronous Q-Learning: Sharper Analysis and Variance Reduction0
A Novel Update Mechanism for Q-Networks Based On Extreme Learning MachinesCode0
Multi-Agent Determinantal Q-LearningCode1
Mitigating Bias in Face Recognition Using Skewness-Aware Reinforcement Learning0
Show:102550
← PrevPage 53 of 77Next →

No leaderboard results yet.