SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 551600 of 1918 papers

TitleStatusHype
DQLAP: Deep Q-Learning Recommender Algorithm with Update Policy for a Real Steam Turbine System0
β-DQN: Improving Deep Q-Learning By Evolving the Behavior0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
Bayesian Risk-Averse Q-Learning with Streaming Observations0
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret0
A Deep Learning Inference Scheme Based on Pipelined Matrix Multiplication Acceleration Design and Non-uniform Quantization0
Bayesian Q-learning With Imperfect Expert Demonstrations0
A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning0
Pretrain Soft Q-Learning with Imperfect Demonstrations0
BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading0
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation0
A General Markov Decision Process Framework for Directly Learning Optimal Control Policies0
Double Q-learning: New Analysis and Sharper Finite-time Bound0
D-Point Trigonometric Path Planning based on Q-Learning in Uncertain Environments0
DQLEL: Deep Q-Learning for Energy-Optimized LoS/NLoS UWB Node Selection0
Dynamic Decision Making in Engineering System Design: A Deep Q-Learning Approach0
Design of Artificial Intelligence Agents for Games using Deep Reinforcement Learning0
Designing Rewards for Fast Learning0
Bandwidth Reservation for Time-Critical Vehicular Applications: A Multi-Operator Environment0
Bandit approach to conflict-free multi-agent Q-learning in view of photonic implementation0
Algorithmic Collusion in Auctions: Evidence from Controlled Laboratory Experiments0
A Machine Learning Approach for Task and Resource Allocation in Mobile Edge Computing Based Networks0
Double Deep Q-Learning for Optimal Execution0
Deviations from the Nash equilibrium and emergence of tacit collusion in a two-player optimal execution game with reinforcement learning0
Design and Comparison of Reward Functions in Reinforcement Learning for Energy Management of Sensor Nodes0
DGFN: Double Generative Flow Networks0
Batch Recurrent Q-Learning for Backchannel Generation Towards Engaging Agents0
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation0
"Did You Hear That?" Learning to Play Video Games from Audio Cues0
Differentiable Quantum Architecture Search for Quantum Reinforcement Learning0
Differentially Private Deep Q-Learning for Pattern Privacy Preservation in MEC Offloading0
Diff-Transfer: Model-based Robotic Manipulation Skill Transfer via Differentiable Physics Simulation0
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task0
Depth and nonlinearity induce implicit exploration for RL0
A Machine Learning Approach for Prosumer Management in Intraday Electricity Markets0
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning0
Deploying Reinforcement Learning in Water Transport0
Digital Twin Assisted Deep Reinforcement Learning for Online Admission Control in Sliced Network0
Digital Twin-Assisted Efficient Reinforcement Learning for Edge Task Scheduling0
Digital Twin-Assisted Knowledge Distillation Framework for Heterogeneous Federated Learning0
Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation0
Directed Exploration in PAC Model-Free Reinforcement Learning0
Dependency-Aware Computation Offloading in Mobile Edge Computing: A Reinforcement Learning Approach0
Balancing Two-Player Stochastic Games with Soft Q-Learning0
A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies0
Double Deep Q-Learning in Opponent Modeling0
Density Estimation for Conservative Q-Learning0
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants0
Show:102550
← PrevPage 12 of 39Next →

No leaderboard results yet.