| DQLAP: Deep Q-Learning Recommender Algorithm with Update Policy for a Real Steam Turbine System | Oct 12, 2022 | Deep LearningFault Detection | —Unverified | 0 |
| β-DQN: Improving Deep Q-Learning By Evolving the Behavior | Jan 1, 2025 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Aug 17, 2016 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes | Oct 4, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Nov 15, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Bayesian Risk-Averse Q-Learning with Streaming Observations | May 18, 2023 | Q-Learning | —Unverified | 0 |
| A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret | Jun 8, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Deep Learning Inference Scheme Based on Pipelined Matrix Multiplication Acceleration Design and Non-uniform Quantization | Oct 10, 2021 | Edge-computingQ-Learning | —Unverified | 0 |
| Bayesian Q-learning With Imperfect Expert Demonstrations | Oct 1, 2022 | Atari GamesQ-Learning | —Unverified | 0 |
| A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning | Aug 1, 2022 | Asset ManagementDeep Reinforcement Learning | —Unverified | 0 |
| Pretrain Soft Q-Learning with Imperfect Demonstrations | May 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading | Apr 27, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation | May 18, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A General Markov Decision Process Framework for Directly Learning Optimal Control Policies | May 28, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Double Q-learning: New Analysis and Sharper Finite-time Bound | Jan 1, 2021 | Q-Learning | —Unverified | 0 |
| D-Point Trigonometric Path Planning based on Q-Learning in Uncertain Environments | Oct 26, 2019 | PositionQ-Learning | —Unverified | 0 |
| DQLEL: Deep Q-Learning for Energy-Optimized LoS/NLoS UWB Node Selection | Aug 24, 2021 | Q-Learning | —Unverified | 0 |
| Dynamic Decision Making in Engineering System Design: A Deep Q-Learning Approach | Dec 28, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Design of Artificial Intelligence Agents for Games using Deep Reinforcement Learning | May 10, 2019 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Designing Rewards for Fast Learning | May 30, 2022 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Bandwidth Reservation for Time-Critical Vehicular Applications: A Multi-Operator Environment | Mar 22, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Bandit approach to conflict-free multi-agent Q-learning in view of photonic implementation | Dec 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Algorithmic Collusion in Auctions: Evidence from Controlled Laboratory Experiments | Jun 15, 2023 | Q-Learning | —Unverified | 0 |
| A Machine Learning Approach for Task and Resource Allocation in Mobile Edge Computing Based Networks | Jul 20, 2020 | BIG-bench Machine LearningEdge-computing | —Unverified | 0 |
| Double Deep Q-Learning for Optimal Execution | Dec 17, 2018 | Q-Learning | —Unverified | 0 |
| Deviations from the Nash equilibrium and emergence of tacit collusion in a two-player optimal execution game with reinforcement learning | Aug 21, 2024 | Q-Learning | —Unverified | 0 |
| Design and Comparison of Reward Functions in Reinforcement Learning for Energy Management of Sensor Nodes | Jun 2, 2021 | energy managementManagement | —Unverified | 0 |
| DGFN: Double Generative Flow Networks | Oct 30, 2023 | Drug DiscoveryQ-Learning | —Unverified | 0 |
| Batch Recurrent Q-Learning for Backchannel Generation Towards Engaging Agents | Aug 6, 2019 | Imitation LearningQ-Learning | —Unverified | 0 |
| DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation | Oct 15, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| "Did You Hear That?" Learning to Play Video Games from Audio Cues | Jun 10, 2019 | Game DesignNavigate | —Unverified | 0 |
| Differentiable Quantum Architecture Search for Quantum Reinforcement Learning | Sep 19, 2023 | Q-LearningQuantum Machine Learning | —Unverified | 0 |
| Differentially Private Deep Q-Learning for Pattern Privacy Preservation in MEC Offloading | Feb 9, 2023 | Edge-computingQ-Learning | —Unverified | 0 |
| Diff-Transfer: Model-based Robotic Manipulation Skill Transfer via Differentiable Physics Simulation | Oct 7, 2023 | Q-Learning | —Unverified | 0 |
| Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task | Oct 15, 2024 | ARCDecision Making | —Unverified | 0 |
| Depth and nonlinearity induce implicit exploration for RL | May 29, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Machine Learning Approach for Prosumer Management in Intraday Electricity Markets | Mar 11, 2022 | BIG-bench Machine LearningManagement | —Unverified | 0 |
| Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning | Feb 5, 2024 | D4RLQ-Learning | —Unverified | 0 |
| Deploying Reinforcement Learning in Water Transport | Dec 14, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Digital Twin Assisted Deep Reinforcement Learning for Online Admission Control in Sliced Network | Oct 7, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Digital Twin-Assisted Efficient Reinforcement Learning for Edge Task Scheduling | Aug 2, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Digital Twin-Assisted Knowledge Distillation Framework for Heterogeneous Federated Learning | Mar 10, 2023 | Federated LearningKnowledge Distillation | —Unverified | 0 |
| Diluted Near-Optimal Expert Demonstrations for Guiding Dialogue Stochastic Policy Optimisation | Nov 25, 2020 | Imitation LearningQ-Learning | —Unverified | 0 |
| Directed Exploration in PAC Model-Free Reinforcement Learning | Aug 31, 2018 | Efficient Explorationmodel | —Unverified | 0 |
| Dependency-Aware Computation Offloading in Mobile Edge Computing: A Reinforcement Learning Approach | Sep 18, 2019 | Cloud ComputingEdge-computing | —Unverified | 0 |
| Balancing Two-Player Stochastic Games with Soft Q-Learning | Feb 9, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies | Mar 25, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Double Deep Q-Learning in Opponent Modeling | Nov 24, 2022 | Mixture-of-ExpertsQ-Learning | —Unverified | 0 |
| Density Estimation for Conservative Q-Learning | Sep 29, 2021 | Density EstimationQ-Learning | —Unverified | 0 |
| A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants | Feb 2, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |