SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 11511200 of 1918 papers

TitleStatusHype
SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems0
Federated Double Deep Q-learning for Joint Delay and Energy Minimization in IoT networks0
Regularized Softmax Deep Multi-Agent Q-Learning0
Reinforcement Learning based on Scenario-tree MPC for ASVs0
Variational quantum compiling with double Q-learning0
Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability0
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning0
A Jointly Optimal Design of Control and Scheduling in Networked Systems under Denial-of-Service Attacks0
The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning0
Decentralized Microgrid Energy Management: A Multi-agent Correlated Q-learning Approach0
Correlated Deep Q-learning based Microgrid Energy Management0
UCB Momentum Q-learning: Correcting the bias without forgettingCode0
Ensemble Bootstrapping for Q-Learning0
Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach0
Reinforcement learning approach for resource allocation in humanitarian logistics0
No-Regret Reinforcement Learning with Heavy-Tailed Rewards0
Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive EnvironmentsCode0
Sequential Learning-based IaaS Composition0
Greedy-Step Off-Policy Reinforcement Learning0
Understanding algorithmic collusion with experience replayCode0
A Discrete-Time Switching System Analysis of Q-learning0
Cooperation and Reputation Dynamics with Reinforcement Learning0
Reversible Action Design for Combinatorial Optimization with Reinforcement Learning0
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis0
Hedging of Financial Derivative Contracts via Monte Carlo Tree Search0
Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States0
Model-Augmented Q-learning0
Revisiting Prioritized Experience Replay: A Value PerspectiveCode0
Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning0
A review of motion planning algorithms for intelligent robotics0
Deep reinforcement learning-based image classification achieves perfect testing set accuracy for MRI brain tumors with a training set of only 30 images0
A step toward a reinforcement learning de novo genome assembler0
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants0
QoS-Aware Power Minimization of Distributed Many-Core Servers using Transfer Q-Learning0
Variation-resistant Q-learning: Controlling and Utilizing Estimation Bias in Reinforcement Learning for Better PerformanceCode0
CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation0
Reinforcement Learning based Per-antenna Discrete Power Control for Massive MIMO Systems0
Reinforcement Learning Assisted Beamforming for Inter-cell Interference Mitigation in 5G Massive MIMO Networks0
Robust Android Malware Detection System against Adversarial Attacks using Q-Learning0
Channel Estimation via Successive Denoising in MIMO OFDM Systems: A Reinforcement Learning Approach0
Solving optimal stopping problems with Deep Q-Learning0
Fire Threat Detection From Videos with Q-Rough Sets0
Breaking the Deadly Triad with a Target Network0
Reinforcement learning based recommender systems: A survey0
Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time SystemsCode0
Learning Augmented Index Policy for Optimal Service Placement at the Network Edge0
Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for MANETs0
Safe Coupled Deep Q-Learning for Recommendation Systems0
Preventing Value Function Collapse in Ensemble Q-Learning by Maximizing Representation Diversity0
Success-Rate Targeted Reinforcement Learning by Disorientation Penalty0
Show:102550
← PrevPage 24 of 39Next →

No leaderboard results yet.