| Stochastic Approximation with Unbounded Markovian Noise: A General-Purpose Theorem | Oct 29, 2024 | Q-LearningStochastic Optimization | —Unverified | 0 | 0 |
| Stochastic Gradient Descent with Dependent Data for Offline Reinforcement Learning | Feb 6, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Stochastic Lipschitz Q-Learning | Apr 24, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Stochastic Q-learning for Large Discrete Action Spaces | May 16, 2024 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Stochastic Variance Reduction for Deep Q-learning | May 20, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Strategizing against Q-learners: A Control-theoretical Approach | Mar 13, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Striving for Simplicity in Off-Policy Deep Reinforcement Learning | Sep 25, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Structural Similarity for Improved Transfer in Reinforcement Learning | Jul 27, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Structured Q-learning For Antibody Design | Sep 10, 2022 | Combinatorial OptimizationMolecular Docking | —Unverified | 0 | 0 |
| Structure Learning of Deep Neural Networks with Q-Learning | Oct 31, 2018 | image-classificationImage Classification | —Unverified | 0 | 0 |
| Structure learning with Temporal Gaussian Mixture for model-based Reinforcement Learning | Nov 18, 2024 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Successive Over Relaxation Q-Learning | Mar 9, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Success-Rate Targeted Reinforcement Learning by Disorientation Penalty | Jan 1, 2021 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Sufficient Exploration for Convex Q-learning | Oct 17, 2022 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Supervised Advantage Actor-Critic for Recommender Systems | Nov 5, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 | 0 |
| Supervised Q-walk for Learning Vector Representation of Nodes in Networks | Oct 3, 2017 | ClassificationGeneral Classification | —Unverified | 0 | 0 |
| Suppressing Overestimation in Q-Learning through Adversarial Behaviors | Oct 10, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Survey on Multi-Agent Q-Learning frameworks for resource management in wireless sensor network | May 5, 2021 | ManagementQ-Learning | —Unverified | 0 | 0 |
| SVQN: Sequential Variational Soft Q-Learning Networks | Jan 1, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning | Mar 12, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Tabular and Deep Learning for the Whittle Index | Jun 4, 2024 | Deep LearningQ-Learning | —Unverified | 0 | 0 |
| Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning | Jun 30, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals | Oct 8, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Taming Lagrangian Chaos with Multi-Objective Reinforcement Learning | Dec 19, 2022 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Target-Based Temporal Difference Learning | Apr 24, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Target Network and Truncation Overcome The Deadly Triad in Q-Learning | Mar 5, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Target Transfer Q-Learning and Its Convergence Analysis | Sep 21, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Task Independent Capsule-Based Agents for Deep Q-Learning | Jan 11, 2022 | Deep Reinforcement LearningObject Recognition | —Unverified | 0 | 0 |
| TD Learning with Constrained Gradients | Jan 1, 2018 | Q-Learning | —Unverified | 0 | 0 |
| Teaching a Robot to Walk Using Reinforcement Learning | Dec 13, 2021 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| Temporal Difference Learning with Compressed Updates: Error-Feedback meets Reinforcement Learning | Jan 3, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Temporal Difference Models: Model-Free Deep RL for Model-Based Control | Feb 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates | Oct 28, 2021 | Q-LearningScheduling | —Unverified | 0 | 0 |
| Text Generation with Efficient (Soft) Q-Learning | Sep 29, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| MinMaxMin Q-learning | Feb 3, 2024 | MuJoCoQ-Learning | —Unverified | 0 | 0 |
| SQT -- std Q-target | Feb 3, 2024 | MuJoCoQ-Learning | —Unverified | 0 | 0 |
| The association problem in wireless networks: a Policy Gradient Reinforcement Learning approach | Jun 11, 2013 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| The Best Time for an Update: Risk-Sensitive Minimization of Age-Based Metrics | Jan 3, 2024 | Q-Learning | —Unverified | 0 | 0 |
| The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond | May 18, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning | Mar 7, 2021 | Q-LearningTransfer Learning | —Unverified | 0 | 0 |
| The Efficacy of Pessimism in Asynchronous Q-Learning | Mar 14, 2022 | Q-Learning | —Unverified | 0 | 0 |
| Evolution of cooperation with Q-learning: the impact of information perception | Jul 29, 2024 | DiversityQ-Learning | —Unverified | 0 | 0 |
| The Gambler's Problem and Beyond | Dec 31, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| The Game Imitation: Deep Supervised Convolutional Networks for Quick Video Game AI | Feb 18, 2017 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| The impact of surplus sharing on the outcomes of specific investments under negotiated transfer pricing: An agent-based simulation with fuzzy Q-learning agents | Jan 28, 2023 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| The Integration of Machine Learning into Automated Test Generation: A Systematic Mapping Study | Jun 21, 2022 | BIG-bench Machine LearningQ-Learning | —Unverified | 0 | 0 |
| The Least Restriction for Offline Reinforcement Learning | Jul 5, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Deep Q-Learning: Theoretical Insights from an Asymptotic Analysis | Aug 25, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| The Point to Which Soft Actor-Critic Converges | Mar 1, 2023 | Q-Learning | —Unverified | 0 | 0 |
| The QLBS Q-Learner Goes NuQLear: Fitted Q Iteration, Inverse RL, and Option Portfolios | Jan 17, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |