| Depth and nonlinearity induce implicit exploration for RL | May 29, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Distributed 3D-Beam Reforming for Hovering-Tolerant UAVs Communication over Coexistence: A Deep-Q Learning for Intelligent Space-Air-Ground Integrated Networks | Jul 18, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Distributed Deep Q-Learning | Aug 18, 2015 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Distributed Deep Reinforcement Learning for Collaborative Spectrum Sharing | Apr 6, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Distributed Edge Caching via Reinforcement Learning in Fog Radio Access Networks | Feb 27, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Machine Learning Approach for Prosumer Management in Intraday Electricity Markets | Mar 11, 2022 | BIG-bench Machine LearningManagement | —Unverified | 0 |
| Deploying Reinforcement Learning in Water Transport | Dec 14, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Distributed Multi-Agent Deep Q-Learning for Fast Roaming in IEEE 802.11ax Wi-Fi Systems | Mar 25, 2023 | Q-Learning | —Unverified | 0 |
| Distributed Q-Learning with State Tracking for Multi-agent Networked Control | Dec 22, 2020 | Q-LearningState Estimation | —Unverified | 0 |
| Distributed Reinforcement Learning for Cooperative Multi-Robot Object Manipulation | Mar 21, 2020 | ObjectQ-Learning | —Unverified | 0 |
| BIBI System Description: Building with CNNs and Breaking with Deep Reinforcement Learning | Sep 1, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Distributional Advantage Actor-Critic | Jun 10, 2018 | Q-Learningquantile regression | —Unverified | 0 |
| Biomimetic Ultra-Broadband Perfect Absorbers Optimised with Reinforcement Learning | Oct 28, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Distributionally Robust Reinforcement Learning | Feb 23, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism | Dec 23, 2023 | Distributional Reinforcement LearningQ-Learning | —Unverified | 0 |
| Distribution-Free Uncertainty Quantification in Mechanical Ventilation Treatment: A Conformal Deep Q-Learning Framework | Dec 17, 2024 | Conformal PredictionDeep Reinforcement Learning | —Unverified | 0 |
| Distributive Dynamic Spectrum Access through Deep Reinforcement Learning: A Reservoir Computing Based Approach | Oct 28, 2018 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition | Feb 2, 2023 | DiversityQ-Learning | —Unverified | 0 |
| DO-IQS: Dynamics-Aware Offline Inverse Q-Learning for Optimal Stopping with Unknown Gain Functions | Mar 5, 2025 | Q-Learning | —Unverified | 0 |
| Domain Adversarial Reinforcement Learning for Partial Domain Adaptation | May 10, 2019 | Domain AdaptationPartial Domain Adaptation | —Unverified | 0 |
| Double A3C: Deep Reinforcement Learning on OpenAI Gym Games | Mar 4, 2023 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Double Deep Q-Learning-based Path Selection and Service Placement for Latency-Sensitive Beyond 5G Applications | Sep 18, 2023 | Q-Learning | —Unverified | 0 |
| Dependency-Aware Computation Offloading in Mobile Edge Computing: A Reinforcement Learning Approach | Sep 18, 2019 | Cloud ComputingEdge-computing | —Unverified | 0 |
| Double Deep Q-Learning for Optimal Execution | Dec 17, 2018 | Q-Learning | —Unverified | 0 |
| Balancing Two-Player Stochastic Games with Soft Q-Learning | Feb 9, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Double Q(σ) and Q(σ, λ): Unifying Reinforcement Learning Control Algorithms | Nov 5, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies | Mar 25, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Double Q-Learning for Citizen Relocation During Natural Hazards | Sep 8, 2022 | Q-Learning | —Unverified | 0 |
| Double Q-learning: New Analysis and Sharper Finite-time Bound | Jan 1, 2021 | Q-Learning | —Unverified | 0 |
| Analytically Tractable Bayesian Deep Q-Learning | Jun 21, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Bootstrapped Hindsight Experience replay with Counterintuitive Prioritization | Sep 29, 2021 | Q-Learning | —Unverified | 0 |
| D-Point Trigonometric Path Planning based on Q-Learning in Uncertain Environments | Oct 26, 2019 | PositionQ-Learning | —Unverified | 0 |
| DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep Q-Learning and Graph Attention Networks | Aug 11, 2021 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| DQLAP: Deep Q-Learning Recommender Algorithm with Update Policy for a Real Steam Turbine System | Oct 12, 2022 | Deep LearningFault Detection | —Unverified | 0 |
| DQLEL: Deep Q-Learning for Energy-Optimized LoS/NLoS UWB Node Selection | Aug 24, 2021 | Q-Learning | —Unverified | 0 |
| DRIFT: Deep Reinforcement Learning for Functional Software Testing | Jul 16, 2020 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |
| DRILL-- Deep Reinforcement Learning for Refinement Operators in ALC | Jun 29, 2021 | Deep Reinforcement LearningKnowledge Graphs | —Unverified | 0 |
| Driving Decision and Control for Autonomous Lane Change based on Deep Reinforcement Learning | Apr 23, 2019 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Breaking the Deadly Triad with a Target Network | Jan 21, 2021 | Q-Learning | —Unverified | 0 |
| DRL-Based Dynamic Channel Access and SCLAR Maximization for Networks Under Jamming | Feb 2, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL | Apr 15, 2024 | GPUOffline RL | —Unverified | 0 |
| Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning | Oct 9, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Density Estimation for Conservative Q-Learning | Sep 29, 2021 | Density EstimationQ-Learning | —Unverified | 0 |
| Bridging the Gap Between Value and Policy Based Reinforcement Learning | Feb 28, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Dynamic Decision Making in Engineering System Design: A Deep Q-Learning Approach | Dec 28, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning With Iterated Q-Learning | Jun 4, 2025 | Q-Learning | —Unverified | 0 |
| A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants | Feb 2, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Dynamic Optimization of Storage Systems Using Reinforcement Learning Techniques | Dec 29, 2024 | CPUQ-Learning | —Unverified | 0 |
| Dynamic Retail Pricing via Q-Learning -- A Reinforcement Learning Framework for Enhanced Revenue Management | Nov 27, 2024 | Decision MakingManagement | —Unverified | 0 |
| Addressing the issue of stochastic environments and local decision-making in multi-objective reinforcement learning | Nov 16, 2022 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |