SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 10511100 of 1918 papers

TitleStatusHype
Smooth Q-learning: Accelerate Convergence of Q-learning Using Similarity0
Design and Comparison of Reward Functions in Reinforcement Learning for Energy Management of Sensor Nodes0
Energy-aware optimization of UAV base stations placement via decentralized multi-agent Q-learning0
A reinforcement learning approach to improve communication performance and energy utilization in fog-based IoT0
SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-LearningCode1
Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model0
Reputation Bootstrapping for Composite Services using CP-nets0
A Comparison of Reward Functions in Q-Learning Applied to a Cart Position ProblemCode0
Verification of Dissipativity and Evaluation of Storage Function in Economic Nonlinear MPC using Q-Learning0
Online Adaptive Optimal Control Algorithm Based on Synchronous Integral Reinforcement Learning With Explorations0
Deep Reinforcement Learning for Optimal Stopping with Application in Financial EngineeringCode0
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization0
Uncertainty Weighted Actor-Critic for Offline Reinforcement LearningCode1
Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare0
Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems0
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective0
Fast constraint satisfaction problem and learning-based algorithm for solving Minesweeper0
Reinforcement Learning with Expert Trajectory For Quantitative Trading0
Survey on Multi-Agent Q-Learning frameworks for resource management in wireless sensor network0
HASCO: Towards Agile HArdware and Software CO-design for Tensor ComputationCode1
Robotic Surgery With Lean Reinforcement LearningCode0
Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action TasksCode0
CARL-DTN: Context Adaptive Reinforcement Learning based Routing Algorithm in Delay Tolerant Network0
RP-DQN: An application of Q-Learning to Vehicle Routing Problems0
Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks0
Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems0
A Simulated Experiment to Explore Robotic Dialogue Strategies for People with Dementia0
Low-rank State-action Value-function ApproximationCode0
Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills0
Prospect-theoretic Q-learning0
Autoequivariant Network Search via Group DecompositionCode0
Optimal Market Making by Reinforcement LearningCode1
Towards Resilience for Multi-Agent QD-Learning0
Distributed Deep Reinforcement Learning for Collaborative Spectrum Sharing0
SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems0
Federated Double Deep Q-learning for Joint Delay and Energy Minimization in IoT networks0
Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability0
Variational quantum compiling with double Q-learning0
Regularized Softmax Deep Multi-Agent Q-Learning0
Reinforcement Learning based on Scenario-tree MPC for ASVs0
A Jointly Optimal Design of Control and Scheduling in Networked Systems under Denial-of-Service Attacks0
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning0
The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning0
Decentralized Microgrid Energy Management: A Multi-agent Correlated Q-learning Approach0
Correlated Deep Q-learning based Microgrid Energy Management0
UCB Momentum Q-learning: Correcting the bias without forgettingCode0
Ensemble Bootstrapping for Q-Learning0
Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach0
Reinforcement learning approach for resource allocation in humanitarian logistics0
No-Regret Reinforcement Learning with Heavy-Tailed Rewards0
Show:102550
← PrevPage 22 of 39Next →

No leaderboard results yet.