| Q-Learning for Continuous Actions with Cross-Entropy Guided Policies | Mar 25, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity | Nov 12, 2021 | Q-LearningQuantization | —Unverified | 0 |
| Mean-Field Controls with Q-learning for Cooperative MARL: Convergence and Complexity Analysis | Feb 10, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Q-learning for Optimal Control of Continuous-time Systems | Oct 11, 2014 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Q-learning for POMDP: An application to learning locomotion gaits | Sep 30, 2019 | Q-Learning | —Unverified | 0 |
| Q-learning for real time control of heterogeneous microagent collectives | Sep 29, 2021 | Q-Learning | —Unverified | 0 |
| Q-Learning for Stochastic Control under General Information Structures and Non-Markovian Environments | Oct 31, 2023 | Q-LearningQuantization | —Unverified | 0 |
| q-Learning in Continuous Time | Jul 2, 2022 | Learning TheoryQ-Learning | —Unverified | 0 |
| Q-Learning in enormous action spaces via amortized approximate maximization | Jan 22, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Q-Learning in Regularized Mean-field Games | Mar 24, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Q-Learning Inspired Self-Tuning for Energy Efficiency in HPC | Jun 26, 2019 | Q-Learning | —Unverified | 0 |
| Q-learning optimization in a multi-agents system for image segmentation | Nov 23, 2013 | Image SegmentationQ-Learning | —Unverified | 0 |
| Q-learning pour la r\'esolution des anaphores pronominales en langue arabe (Q-learning for pronominal anaphora resolution in Arabic texts) | Jul 1, 2019 | Q-Learning | —Unverified | 0 |
| Q-Learning Scheduler for Multi-Task Learning through the use of Histogram of Task Uncertainty | Sep 29, 2021 | Multi-Task LearningQ-Learning | —Unverified | 0 |
| Q-Learning Scheduler for Multi Task Learning Through the use of Histogram of Task Uncertainty | May 1, 2022 | Multi-Task LearningQ-Learning | —Unverified | 0 |
| Q-learning with temporal memory to navigate turbulence | Apr 26, 2024 | Decision MakingNavigate | —Unverified | 0 |
| Q-Learning with Basic Emotions | Sep 6, 2016 | Q-Learning | —Unverified | 0 |
| Q-Learning with Clustered-SMART (cSMART) Data: Examining Moderators in the Construction of Clustered Adaptive Interventions | May 1, 2025 | Q-Learning | —Unverified | 0 |
| Q-Learning with Differential Entropy of Q-Tables | Jun 26, 2020 | Q-Learning | —Unverified | 0 |
| Q-learning with Logarithmic Regret | Jun 16, 2020 | Q-Learning | —Unverified | 0 |
| Q-learning with Nearest Neighbors | Feb 12, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Q-learning with online random forests | Apr 7, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP | Jan 27, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning | Feb 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Q-MIND: Defeating Stealthy DoS Attacks in SDN with a Machine-learning based Defense Framework | Jul 27, 2019 | Anomaly DetectionBIG-bench Machine Learning | —Unverified | 0 |
| Q-Networks for Binary Vector Actions | Dec 4, 2015 | Q-Learningreinforcement-learning | —Unverified | 0 |
| QoS-Aware Power Minimization of Distributed Many-Core Servers using Transfer Q-Learning | Feb 2, 2021 | Q-Learning | —Unverified | 0 |
| Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning | Nov 7, 2024 | Offline RLPolicy Gradient Methods | —Unverified | 0 |
| Q-SMASH: Q-Learning-based Self-Adaptation of Human-Centered Internet of Things | Jul 13, 2021 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions | Sep 18, 2023 | Imitation LearningOffline RL | —Unverified | 0 |
| QT-TDM: Planning With Transformer Dynamics Model and Autoregressive Q-Learning | Jul 26, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Quadratic Q-network for Learning Continuous Control for Autonomous Vehicles | Nov 29, 2019 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping | Oct 1, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Quantitative Trading using Deep Q Learning | Apr 3, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Quantum Architecture Search via Continual Reinforcement Learning | Dec 10, 2021 | Continual LearningDeep Reinforcement Learning | —Unverified | 0 |
| Quantum deep Q learning with distributed prioritized experience replay | Apr 19, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Quantum deep recurrent reinforcement learning | Oct 26, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Quantum-Inspired Reinforcement Learning in the Presence of Epistemic Ambivalence | Mar 6, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Quantum Observables for continuous control of the Quantum Approximate Optimization Algorithm via Reinforcement Learning | Nov 21, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Reinforcement Learning via L-BFGS Optimization | Nov 6, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Q-WSL: Optimizing Goal-Conditioned RL with Weighted Supervised Learning via Dynamic Programming | Oct 9, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Reward Prediction Error as an Exploration Objective in Deep RL | Jun 19, 2019 | Atari GamesContinuous Control | —Unverified | 0 |
| QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Random-Key Algorithms for Optimizing Integrated Operating Room Scheduling | Jan 17, 2025 | Combinatorial OptimizationDecoder | —Unverified | 0 |
| Rank-One Modified Value Iteration | May 3, 2025 | Q-Learning | —Unverified | 0 |
| RansomAI: AI-powered Ransomware for Stealthy Encryption | Jun 27, 2023 | Q-LearningRaspberry Pi 4 | —Unverified | 0 |
| RCsearcher: Reaction Center Identification in Retrosynthesis via Deep Q-Learning | Jan 28, 2023 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |
| Real-time Active Vision for a Humanoid Soccer Robot Using Deep Reinforcement Learning | Nov 27, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Realtime Spectrum Monitoring via Reinforcement Learning -- A Comparison Between Q-Learning and Heuristic Methods | Jul 11, 2023 | ManagementQ-Learning | —Unverified | 0 |
| Real-World Offline Reinforcement Learning from Vision Language Model Feedback | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |