SOTAVerified

Q-Learning

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 451500 of 1918 papers

TitleStatusHype
Numeric Reward Machines0
Reinforcement Learning Problem Solving with Large Language Models0
Using Deep Q-Learning to Dynamically Toggle between Push/Pull Actions in Computational Trust Mechanisms0
Q-learning with temporal memory to navigate turbulence0
Recursive Backwards Q-Learning in Deterministic Environments0
Age of Information Minimization using Multi-agent UAVs based on AI-Enhanced Mean Field Resource Allocation0
AFU: Actor-Free critic Updates in off-policy RL for continuous controlCode0
Unified ODE Analysis of Smooth Q-Learning Algorithms0
Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty0
Data-Incremental Continual Offline Reinforcement Learning0
From r to Q^*: Your Language Model is Secretly a Q-Function0
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL0
Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement0
Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive SA0
Traffic Signal Control and Speed Offset Coordination Using Q-Learning for Arterial Road Networks0
Deep Reinforcement Learning Control for Disturbance Rejection in a Nonlinear Dynamic System with Parametric Uncertainty0
Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution0
Superior Genetic Algorithms for the Target Set Selection Problem Based on Power-Law Parameter Choices and Simple Greedy HeuristicsCode0
Data-Driven Knowledge Transfer in Batch Q^* Learning0
Utilizing Maximum Mean Discrepancy Barycenter for Propagating the Uncertainty of Value Functions in Reinforcement Learning0
EnCoMP: Enhanced Covert Maneuver Planning with Adaptive Threat-Aware Visibility Estimation using Offline Reinforcement Learning0
From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no LibrariesCode0
Compressed Federated Reinforcement Learning with a Generative ModelCode0
DASA: Delay-Adaptive Multi-Agent Stochastic Approximation0
Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints0
A Fairness-Oriented Reinforcement Learning Approach for the Operation and Control of Shared Micromobility ServicesCode0
Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study0
State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards0
Neural-Kernel Conditional Mean Embeddings0
A Reinforcement Learning Approach to Dairy Farm Battery Management using Q Learning0
Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning0
Strategizing against Q-learners: A Control-theoretical Approach0
Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services0
Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning0
Scalable Online Exploration via CoverabilityCode0
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach0
Algorithmic Collusion and Price Discrimination: The Over-Usage of Data0
Enhancing Classification Performance via Reinforcement Learning for Feature Selection0
Belief-Enriched Pessimistic Q-Learning against Adversarial State PerturbationsCode0
SMAUG: A Sliding Multidimensional Task Window-Based MARL Framework for Adaptive Real-Time Subtask Recognition0
QF-tuner: Breaking Tradition in Reinforcement Learning0
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation LearningCode0
Reinforcement Learning for Optimal Execution when Liquidity is Time-Varying0
Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling0
An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking0
Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model0
Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization0
Reinforcement learning to maximise wind turbine energy generation0
Exploiting Estimation Bias in Clipped Double Q-Learning for Continous Control Reinforcement Learning Tasks0
Intelligent Agricultural Management Considering N_2O Emission and Climate Variability with Uncertainties0
Show:102550
← PrevPage 10 of 39Next →

No leaderboard results yet.