SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 13511400 of 1918 papers

TitleStatusHype
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies0
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes0
Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation0
Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets0
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient0
Offline Reinforcement Learning with Imbalanced Datasets0
Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints0
Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling0
Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization0
Off-policy Multi-step Q-learning0
On Assessing The Safety of Reinforcement Learning algorithms Using Formal Methods0
On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process0
On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection0
On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly Communicating MDPs0
On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes0
On Decentralizing Federated Reinforcement Learning in Multi-Robot Scenarios0
On Designing Multi-UAV aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning0
On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment0
On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality0
Online Adaptive Optimal Control Algorithm Based on Synchronous Integral Reinforcement Learning With Explorations0
Online Antenna Tuning in Heterogeneous Cellular Networks with Deep Reinforcement Learning0
On-line Building Energy Optimization using Deep Reinforcement Learning0
Online Frequency Scheduling by Learning Parallel Actions0
Online inductive learning from answer sets for efficient reinforcement learning exploration0
Online Learning for Offloading and Autoscaling in Energy Harvesting Mobile Edge Computing0
Online Robust Reinforcement Learning with Model Uncertainty0
Online Statistical Inference for Nonlinear Stochastic Approximation with Markovian Data0
Asymptotic Analysis of Sample-averaged Q-learning0
Online Target Q-learning with Reverse Experience Replay: Efficiently finding the Optimal Policy for Linear MDPs0
Online Transfer Learning in Reinforcement Learning Domains0
Online waveform selection for cognitive radar0
On optimal tracking portfolio in incomplete markets: The reinforcement learning approach0
On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm0
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with ε-Greedy Exploration0
On the Convergence of Approximate and Regularized Policy Iteration Schemes0
On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs0
On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments0
On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization0
On the Reduction of Variance and Overestimation of Deep Q-Learning0
OPA-Pack: Object-Property-Aware Robotic Bin Packing0
OpenSense: An Open-World Sensing Framework for Incremental Learning and Dynamic Sensor Scheduling on Embedded Edge Devices0
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning0
Optimal and Fast Real-time Resources Slicing with Deep Dueling Neural Networks0
Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach0
Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC0
Optimal coordination of resources: A solution from reinforcement learning0
Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning0
Optimal Decision-Making in Mixed-Agent Partially Observable Stochastic Environments via Reinforcement Learning0
Optimal Demand Response Using Device Based Reinforcement Learning0
Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services0
Show:102550
← PrevPage 28 of 39Next →

No leaderboard results yet.