SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 18011825 of 1918 papers

TitleStatusHype
Data-Driven Knowledge Transfer in Batch Q^* Learning0
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation0
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control0
Data-Efficient Quadratic Q-Learning Using LMIs0
DDPG based on multi-scale strokes for financial time series trading strategy0
DDPG Learning for Aerial RIS-Assisted MU-MISO Communications0
DECAF: Learning to be Fair in Multi-agent Resource Allocation0
Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion0
Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration0
On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning0
Decentralized Microgrid Energy Management: A Multi-agent Correlated Q-learning Approach0
Decentralized model-free reinforcement learning in stochastic games with average-reward objective0
Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method0
Decentralized Multi-Robot Formation Control Using Reinforcement Learning0
Decentralized Q-Learning for Stochastic Teams and Games0
Decentralized Q-Learning in Zero-sum Markov Games0
Decentralized Semantic Traffic Control in AVs Using RL and DQN for Dynamic Roadblocks0
Deceptive Reinforcement Learning Under Adversarial Manipulations on Cost Signals0
Decision-making at Unsignalized Intersection for Autonomous Vehicles: Left-turn Maneuver with Deep Reinforcement Learning0
Decoding surface codes with deep reinforcement learning and probabilistic policy reuse0
Decoding trust: A reinforcement learning perspective0
Decorrelated Double Q-learning0
DeepCQ+: Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for Highly Dynamic Networks0
Deep-Dispatch: A Deep Reinforcement Learning-Based Vehicle Dispatch Algorithm for Advanced Air Mobility0
Deep Episodic Value Iteration for Model-based Meta-Reinforcement Learning0
Show:102550
← PrevPage 73 of 77Next →

No leaderboard results yet.