SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 12511300 of 1918 papers

TitleStatusHype
MFC-EQ: Mean-Field Control with Envelope Q-Learning for Moving Decentralized Agents in Formation0
Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning0
Mimicking Human Intuition: Cognitive Belief-Driven Q-Learning0
Minimax Optimal Q Learning with Nearest Neighbors0
Minimizing Age-of-Information for Fog Computing-supported Vehicular Networks with Deep Q-learning0
Minimizing the Outage Probability in a Markov Decision Process0
Misspecified Q-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error0
Mitigate Bias in Face Recognition using Skewness-Aware Reinforcement Learning0
Mitigating Bias in Face Recognition Using Skewness-Aware Reinforcement Learning0
Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning0
Mixed-Precision Conjugate Gradient Solvers with RL-Driven Precision Tuning0
Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning0
Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks0
Model-Augmented Q-learning0
Model-based Multi-Agent Reinforcement Learning with Cooperative Prioritized Sweeping0
Model-Based Reinforcement Learning for Type 1Diabetes Blood Glucose Control0
Model-based versus model-free feeding control and water quality monitoring for fish growth tracking in aquaculture systems0
Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints0
Model-Free Algorithm and Regret Analysis for MDPs with Long-Term Constraints0
Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games0
Model-Free Characterizations of the Hamilton-Jacobi-Bellman Equation and Convex Q-Learning in Continuous Time0
Model-free Control of Chaos with Continuous Deep Q-learning0
Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning0
Model-free optimal controller for discrete-time Markovian jump linear systems: A Q-learning approach0
Model-free Posterior Sampling via Learning Rate Randomization0
Model-Free Reinforcement Learning for Automated Fluid Administration in Critical Care0
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation0
Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning0
Model-Free Robust Average-Reward Reinforcement Learning0
Modeling Fake News in Social Networks with Deep Multi-Agent Reinforcement Learning0
Modelling Bahdanau Attention using Election methods aided by Q-Learning0
Modelling Stock-market Investors as Reinforcement Learning Agents [Correction]0
Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach0
Modified Double DQN: addressing stability0
MODRL-TA:A Multi-Objective Deep Reinforcement Learning Framework for Traffic Allocation in E-Commerce Search0
Momentum Q-learning with Finite-Sample Convergence Guarantee0
Multi-agent Assessment with QoS Enhancement for HD Map Updates in a Vehicular Network0
Multi-agent Bayesian Deep Reinforcement Learning for Microgrid Energy Management under Communication Failures0
Multi-Agent Deep Reinforcement Learning for Energy Efficient Multi-Hop STAR-RIS-Assisted Transmissions0
Multi Agent DeepRL based Joint Power and Subchannel Allocation in IAB networks0
Multi-Agent Double Deep Q-Learning for Beamforming in mmWave MIMO Networks0
Multi-Agent Inverse Q-Learning from Demonstrations0
Multi-Agent Q-Learning Dynamics in Random Networks: Convergence due to Exploration and Sparsity0
Multi-Agent Q-Learning for Minimizing Demand-Supply Power Deficit in Microgrids0
Multi-Agent Q-Learning for Real-Time Load Balancing User Association and Handover in Mobile Networks0
Multi-Agent Reinforcement Learning Based Resource Allocation for UAV Networks0
Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs0
Multi-agent Reinforcement Learning for Resource Allocation in IoT networks with Edge Computing0
Multi-Agent Reinforcement Learning for Markov Routing Games: A New Modeling Paradigm For Dynamic Traffic Assignment0
Multi-Agent Reinforcement Learning for Channel Assignment and Power Allocation in Platoon-Based C-V2X Systems0
Show:102550
← PrevPage 26 of 39Next →

No leaderboard results yet.