| A Study of Continual Learning Methods for Q-Learning | Jun 8, 2022 | Continual LearningQ-Learning | —Unverified | 0 | 0 |
| A study of first-passage time minimization via Q-learning in heated gridworlds | Oct 5, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A study on a Q-Learning algorithm application to a manufacturing assembly problem | Apr 17, 2023 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| A review of motion planning algorithms for intelligent robotics | Feb 4, 2021 | Motion PlanningQ-Learning | —Unverified | 0 | 0 |
| Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics | Jan 23, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Asymptotic regularity of a generalised stochastic Halpern scheme with applications | Nov 7, 2024 | Q-LearningStochastic Optimization | —Unverified | 0 | 0 |
| Asymptotics of Reinforcement Learning with Neural Networks | Nov 13, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Unsynchronized Decentralized Q-Learning: Two Timescale Analysis By Persistence | Aug 7, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets | Jan 20, 2023 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Asynchronous Stochastic Approximation and Average-Reward Reinforcement Learning | Sep 5, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Technique to Create Weaker Abstract Board Game Agents via Reinforcement Learning | Sep 1, 2022 | Board GamesQ-Learning | —Unverified | 0 | 0 |
| A Theoretical Analysis of Deep Q-Learning | Jan 1, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Theory of Regularized Markov Decision Processes | Jan 31, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Attitude Control of Highly Maneuverable Aircraft Using an Improved Q-learning | Oct 22, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| A Tutorial Introduction to Reinforcement Learning | Apr 3, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A unified decision making framework for supply and demand management in microgrid networks | Nov 14, 2017 | Decision MakingManagement | —Unverified | 0 | 0 |
| A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms | Dec 4, 2019 | Q-Learning | —Unverified | 0 | 0 |
| A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms | Dec 1, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Automatic Derivation Of Formulas Using Reforcement Learning | Aug 15, 2018 | Q-Learning | —Unverified | 0 | 0 |
| Automatic Discovery of Multi-perspective Process Model using Reinforcement Learning | Nov 30, 2022 | Model DiscoveryQ-Learning | —Unverified | 0 | 0 |
| Automatic formation of the structure of abstract machines in hierarchical reinforcement learning with state clustering | Jun 13, 2018 | ClusteringHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| Automating Control of Overestimation Bias for Reinforcement Learning | Oct 26, 2021 | Continuous ControlQ-Learning | —Unverified | 0 | 0 |
| Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning | Sep 17, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Autonomous Control of a Line Follower Robot Using a Q-Learning Controller | Jan 23, 2020 | FrictionQ-Learning | —Unverified | 0 | 0 |
| Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space | Apr 8, 2015 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Autonomous Driving with Deep Reinforcement Learning in CARLA Simulation | Jun 20, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |
| Autonomous Penetration Testing using Reinforcement Learning | May 15, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Autonomous Vehicle Decision-Making Framework for Considering Malicious Behavior at Unsignalized Intersections | Sep 11, 2024 | Autonomous VehiclesDecision Making | —Unverified | 0 | 0 |
| Autonomous Vehicle Fleet Coordination With Deep Reinforcement Learning | Jan 1, 2018 | Autonomous VehiclesDecision Making | —Unverified | 0 | 0 |
| Autonomous Warehouse Robot using Deep Q-Learning | Feb 21, 2022 | Deep Reinforcement LearningNavigate | —Unverified | 0 | 0 |
| Avoiding Catastrophic States with Intrinsic Fear | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets | Nov 3, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Balancing a CartPole System with Reinforcement Learning -- A Tutorial | Jun 8, 2020 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Balancing Profit, Risk, and Sustainability for Portfolio Management | Jun 6, 2022 | ManagementPortfolio Optimization | —Unverified | 0 | 0 |
| Balancing Two-Player Stochastic Games with Soft Q-Learning | Feb 9, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Bandit approach to conflict-free multi-agent Q-learning in view of photonic implementation | Dec 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Bandwidth Reservation for Time-Critical Vehicular Applications: A Multi-Operator Environment | Mar 22, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 | 0 |
| Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation | May 18, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading | Apr 27, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Batch Recurrent Q-Learning for Backchannel Generation Towards Engaging Agents | Aug 6, 2019 | Imitation LearningQ-Learning | —Unverified | 0 | 0 |
| Bayesian Q-learning With Imperfect Expert Demonstrations | Oct 1, 2022 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Bayesian Risk-Averse Q-Learning with Streaming Observations | May 18, 2023 | Q-Learning | —Unverified | 0 | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Nov 15, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Aug 17, 2016 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 | 0 |
| β-DQN: Improving Deep Q-Learning By Evolving the Behavior | Jan 1, 2025 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 | 0 |
| A Multi-Agent Reinforcement Learning Approach For Safe and Efficient Behavior Planning Of Connected Autonomous Vehicles | Mar 9, 2020 | Autonomous VehiclesMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Benchmarking projective simulation in navigation problems | Apr 23, 2018 | BenchmarkingQ-Learning | —Unverified | 0 | 0 |
| Best Possible Q-Learning | Feb 2, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning | Aug 6, 1999 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Bias or Optimality? Disentangling Bayesian Inference and Learning Biases in Human Decision-Making | May 12, 2025 | Bayesian InferenceDecision Making | —Unverified | 0 | 0 |