SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 426450 of 1918 papers

TitleStatusHype
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation0
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning0
Coverage Analysis of Multi-Environment Q-Learning Algorithms for Wireless Network Optimization0
Coverage-aware and Reinforcement Learning Using Multi-agent Approach for HD Map QoS in a Realistic Environment0
Credit Assignment: Challenges and Opportunities in Developing Human-like AI Agents0
Credit-cognisant reinforcement learning for multi-agent cooperation0
Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning0
Cross Learning in Deep Q-Networks0
A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow0
Curriculum Q-Learning for Visual Vocabulary Acquisition0
Cycles and collusion in congestion games under Q-learning0
A reinforcement learning approach to improve communication performance and energy utilization in fog-based IoT0
DASA: Delay-Adaptive Multi-Agent Stochastic Approximation0
Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients0
Data-Driven H-infinity Control with a Real-Time and Efficient Reinforcement Learning Algorithm: An Application to Autonomous Mobility-on-Demand Systems0
Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning0
Data-Driven Knowledge Transfer in Batch Q^* Learning0
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation0
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control0
Data-Efficient Quadratic Q-Learning Using LMIs0
DDPG based on multi-scale strokes for financial time series trading strategy0
Breaking the Deadly Triad with a Target Network0
DECAF: Learning to be Fair in Multi-agent Resource Allocation0
Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion0
An Attempt to Model Human Trust with Reinforcement Learning0
Show:102550
← PrevPage 18 of 77Next →

No leaderboard results yet.