SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 10761100 of 1918 papers

TitleStatusHype
Learning-Based Strategy Design for Robot-Assisted Reminiscence Therapy Based on a Developed Model for People with Dementia0
Event-Based Communication in Distributed Q-Learning0
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge DistillationCode0
Deep Reinforcement Learning for Dynamic Band Switch in Cellular-Connected UAV0
DQLEL: Deep Q-Learning for Energy-Optimized LoS/NLoS UWB Node Selection0
An Independent Study of Reinforcement Learning and Autonomous Driving0
DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep Q-Learning and Graph Attention Networks0
Maximizing Influence with Graph Neural Networks0
Modified Double DQN: addressing stability0
An Elementary Proof that Q-learning Converges Almost Surely0
Offline Decentralized Multi-Agent Reinforcement Learning0
SABER: Data-Driven Motion Planner for Autonomously Navigating Heterogeneous RobotsCode0
A DQN-based Approach to Finding Precise Evidences for Fact VerificationCode0
Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings0
A Distributed Intelligence Architecture for B5G Network Automation0
Double Deep Q-learning Based Real-Time Optimization Strategy for Microgrids0
Integrating Deep Learning and Augmented Reality to Enhance Situational Awareness in Firefighting Environments0
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning0
Q-SMASH: Q-Learning-based Self-Adaptation of Human-Centered Internet of Things0
Transfer Learning in Multi-Agent Reinforcement Learning with Double Q-Networks for Distributed Resource Sharing in V2X Communication0
A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens0
Reinforced Hybrid Genetic Algorithm for the Traveling Salesman Problem0
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy LearningCode0
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement LearningCode0
The Least Restriction for Offline Reinforcement Learning0
Show:102550
← PrevPage 44 of 77Next →

No leaderboard results yet.