SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 9511000 of 1918 papers

TitleStatusHype
Dropout Q-Functions for Doubly Efficient Reinforcement LearningCode1
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-EnsembleCode1
A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes0
Cellular traffic offloading via Opportunistic Networking with Reinforcement Learning0
Learning the Markov Decision Process in the Sparse Gaussian EliminationCode1
Learning Explicit Credit Assignment for Multi-agent Joint Q-learning0
Polyphonic Music Composition: An Adversarial Inverse Reinforcement Learning Approach0
Q-Learning Scheduler for Multi-Task Learning through the use of Histogram of Task Uncertainty0
Unifying Top-down and Bottom-up for Recurrent Visual Attention0
Value Refinement Network (VRN)0
Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis0
An Attempt to Model Human Trust with Reinforcement Learning0
Robust and Data-efficient Q-learning by Composite Value-estimation0
^2-exploration for Reinforcement Learning0
Bootstrapped Hindsight Experience replay with Counterintuitive Prioritization0
Adaptive Q-learning for Interaction-Limited Reinforcement Learning0
Offline Reinforcement Learning with In-sample Q-LearningCode1
Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration0
Towards Unknown-aware Deep Q-Learning0
Q-learning for real time control of heterogeneous microagent collectives0
Convergent and Efficient Deep Q Learning Algorithm0
Density Estimation for Conservative Q-Learning0
Text Generation with Efficient (Soft) Q-Learning0
Untangling Braids with Multi-agent Q-Learning0
Online Robust Reinforcement Learning with Model Uncertainty0
Deep Reinforcement Q-Learning for Intelligent Traffic Signal Control with Partial DetectionCode1
On the Estimation Bias in Double Q-LearningCode0
Deep Reinforcement Learning with Adjustments0
Smart Home Energy Management: Sequence-to-Sequence Load Forecasting and Q-Learning0
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy GradientsCode0
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic MethodsCode0
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning0
Off-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demands0
Search For Deep Graph Neural Networks0
Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning0
Regularize! Don't Mix: Multi-Agent Reinforcement Learning without Explicit Centralized Structures0
Learning from Peers: Deep Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G RAN Slicing0
Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback0
Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning0
Deep hierarchical reinforcement agents for automated penetration testing0
Bootstrapped Meta-LearningCode0
User Tampering in Reinforcement Learning Recommender Systems0
Deep Active Inference for Pixel-Based Discrete Control: Evaluation on the Car Racing ProblemCode0
Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement Learning0
Deep SIMBAD: Active Landmark-based Self-localization Using Ranking -based Scene Descriptor0
Learning-Based Strategy Design for Robot-Assisted Reminiscence Therapy Based on a Developed Model for People with Dementia0
Event-Based Communication in Distributed Q-Learning0
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge DistillationCode0
Deep Reinforcement Learning for Dynamic Band Switch in Cellular-Connected UAV0
DQLEL: Deep Q-Learning for Energy-Optimized LoS/NLoS UWB Node Selection0
Show:102550
← PrevPage 20 of 39Next →

No leaderboard results yet.