| Adapting Double Q-Learning for Continuous Reinforcement Learning | Sep 25, 2023 | MuJoCoQ-Learning | —Unverified | 0 | 0 |
| Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback | Jun 20, 2023 | MuJoCoQ-Learning | —Unverified | 0 | 0 |
| Adaptive Knowledge-based Multi-Objective Evolutionary Algorithm for Hybrid Flow Shop Scheduling Problems with Multiple Parallel Batch Processing Stages | Sep 27, 2024 | Q-LearningScheduling | —Unverified | 0 | 0 |
| Adaptive Modulation and Coding based on Reinforcement Learning for 5G Networks | Nov 25, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Adaptive Q-learning for Interaction-Limited Reinforcement Learning | Sep 29, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Adaptive Services Function Chain Orchestration For Digital Health Twin Use Cases: Heuristic-boosted Q-Learning Approach | Apr 25, 2023 | Q-LearningScheduling | —Unverified | 0 | 0 |
| Adaptive Stochastic Resource Control: A Machine Learning Approach | Jan 15, 2014 | BIG-bench Machine LearningClustering | —Unverified | 0 | 0 |
| Adaptive Structural Hyper-Parameter Configuration by Q-Learning | Mar 2, 2020 | Evolutionary AlgorithmsQ-Learning | —Unverified | 0 | 0 |
| A Data-Ensemble-Based Approach for Sample-Efficient LQ Control of Linear Time-Varying Systems | Jun 30, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 | 0 |
| Addressing the issue of stochastic environments and local decision-making in multi-objective reinforcement learning | Nov 16, 2022 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 | 0 |
| A Deep Learning Inference Scheme Based on Pipelined Matrix Multiplication Acceleration Design and Non-uniform Quantization | Oct 10, 2021 | Edge-computingQ-Learning | —Unverified | 0 | 0 |
| A deep Q-Learning based Path Planning and Navigation System for Firefighting Environments | Nov 12, 2020 | Q-Learning | —Unverified | 0 | 0 |
| A Deep Q-Learning based Smart Scheduling of EVs for Demand Response in Smart Grids | Jan 5, 2024 | Q-LearningScheduling | —Unverified | 0 | 0 |
| A Deep Q-learning/genetic Algorithms Based Novel Methodology For Optimizing Covid-19 Pandemic Government Actions | May 15, 2020 | Q-Learning | —Unverified | 0 | 0 |
| A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks | Apr 30, 2019 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise | Jan 28, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents | Jun 17, 2021 | Deep Reinforcement LearningPosition | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback | Oct 3, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in Next-gen Networks | Feb 7, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Approach to Efficient Drone Mobility Support | May 11, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization | Jul 1, 2024 | Deep Reinforcement Learningenergy management | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control | Nov 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing | Oct 5, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Trader without Offline Training | Mar 1, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Differentiable Physics Engine for Deep Learning in Robotics | Nov 5, 2016 | CPUDeep Learning | —Unverified | 0 | 0 |
| A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms | Mar 27, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint | Feb 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| A Dual-Hormone Closed-Loop Delivery System for Type 1 Diabetes Using Deep Reinforcement Learning | Oct 9, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Advancing Algorithmic Trading: A Multi-Technique Enhancement of Deep Q-Network Models | Nov 9, 2023 | Algorithmic TradingQ-Learning | —Unverified | 0 | 0 |
| Advancing ECG Diagnosis Using Reinforcement Learning on Global Waveform Variations Related to P Wave and PR Interval | Jan 10, 2024 | Q-LearningRhythm | —Unverified | 0 | 0 |
| Advancing Forest Fire Prevention: Deep Reinforcement Learning for Effective Firebreak Placement | Apr 12, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Adversarial Agents For Attacking Inaudible Voice Activated Devices | Jul 23, 2023 | CyberBattleSimQ-Learning | —Unverified | 0 | 0 |
| Aerial Base Station Positioning and Power Control for Securing Communications: A Deep Q-Network Approach | Dec 21, 2021 | PositionQ-Learning | —Unverified | 0 | 0 |
| A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning | Jan 16, 2022 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| A Finite Sample Complexity Bound for Distributionally Robust Q-learning | Feb 26, 2023 | Q-Learning | —Unverified | 0 | 0 |
| A finite time analysis of distributed Q-learning | May 23, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation | Dec 10, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation | Jun 6, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| A Flexible Framework for Incorporating Patient Preferences Into Q-Learning | Jul 22, 2023 | Q-Learning | —Unverified | 0 | 0 |
| A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks | May 20, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A General Control-Theoretic Approach for Reinforcement Learning: Theory and Algorithms | Jun 20, 2024 | Learning TheoryQ-Learning | —Unverified | 0 | 0 |
| A General Framework for Learning Mean-Field Games | Mar 13, 2020 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging | May 27, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Agent-state based policies in POMDPs: Beyond belief-state MDPs | Sep 24, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Age of Information Minimization using Multi-agent UAVs based on AI-Enhanced Mean Field Resource Allocation | Apr 24, 2024 | Q-LearningScheduling | —Unverified | 0 | 0 |
| Age-of-information minimization via opportunistic sampling by an energy harvesting source | Jan 8, 2022 | Q-Learning | —Unverified | 0 | 0 |
| Age of Trust (AoT): A Continuous Verification Framework for Wireless Networks | Jun 4, 2024 | PhilosophyQ-Learning | —Unverified | 0 | 0 |
| A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm | Aug 9, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance | Nov 17, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |