| Gradient Temporal-Difference Learning with Regularized Corrections | Jul 1, 2020 | Q-Learning | CodeCode Available | 1 |
| Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls | Oct 27, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient | Oct 13, 2022 | Montezuma's RevengeQ-Learning | CodeCode Available | 1 |
| A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities | Nov 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Hybrid Q-Learning Sine-Cosine-based Strategy for Addressing the Combinatorial Test Suite Minimization Problem | Apr 27, 2018 | Q-Learning | —Unverified | 0 |
| Adaptive Stochastic Resource Control: A Machine Learning Approach | Jan 15, 2014 | BIG-bench Machine LearningClustering | —Unverified | 0 |
| A Hybrid PAC Reinforcement Learning Algorithm | Sep 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Graph Attention Learning Approach to Antenna Tilt Optimization | Dec 27, 2021 | Graph AttentionQ-Learning | —Unverified | 0 |
| Adaptive Services Function Chain Orchestration For Digital Health Twin Use Cases: Heuristic-boosted Q-Learning Approach | Apr 25, 2023 | Q-LearningScheduling | —Unverified | 0 |
| A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control | Aug 10, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Agnostic Q-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity | Dec 1, 2020 | Q-Learning | —Unverified | 0 |
| Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity | Feb 17, 2020 | Q-Learning | —Unverified | 0 |
| Adaptive Q-learning for Interaction-Limited Reinforcement Learning | Sep 29, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance | Nov 17, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm | Aug 9, 2024 | Q-Learning | —Unverified | 0 |
| Adaptive Modulation and Coding based on Reinforcement Learning for 5G Networks | Nov 25, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Comparative Study of AI-based Intrusion Detection Techniques in Critical Infrastructures | Jul 24, 2020 | Intrusion DetectionManagement | —Unverified | 0 |
| QADQN: Quantum Attention Deep Q-Network for Financial Market Prediction | Aug 6, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Age of Trust (AoT): A Continuous Verification Framework for Wireless Networks | Jun 4, 2024 | PhilosophyQ-Learning | —Unverified | 0 |
| Age-of-information minimization via opportunistic sampling by an energy harvesting source | Jan 8, 2022 | Q-Learning | —Unverified | 0 |
| Adaptive Knowledge-based Multi-Objective Evolutionary Algorithm for Hybrid Flow Shop Scheduling Problems with Multiple Parallel Batch Processing Stages | Sep 27, 2024 | Q-LearningScheduling | —Unverified | 0 |
| Age of Information Minimization using Multi-agent UAVs based on AI-Enhanced Mean Field Resource Allocation | Apr 24, 2024 | Q-LearningScheduling | —Unverified | 0 |
| Agent-state based policies in POMDPs: Beyond belief-state MDPs | Sep 24, 2024 | Q-Learning | —Unverified | 0 |
| Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback | Jun 20, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| A Comparative Analysis of Portfolio Optimization Using Mean-Variance, Hierarchical Risk Parity, and Reinforcement Learning Approaches on the Indian Stock Market | May 27, 2023 | Portfolio OptimizationQ-Learning | —Unverified | 0 |
| A Comparative Analysis of Deep Reinforcement Learning-enabled Freeway Decision-making for Automated Vehicles | Aug 4, 2020 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| A General-Purpose Theorem for High-Probability Bounds of Stochastic Approximation with Polyak Averaging | May 27, 2025 | Q-Learning | —Unverified | 0 |
| Reinforcement Learning for an Efficient and Effective Malware Investigation during Cyber Incident Response | Aug 4, 2024 | Decision MakingMalware Analysis | —Unverified | 0 |
| A General Framework for Learning Mean-Field Games | Mar 13, 2020 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| A General Control-Theoretic Approach for Reinforcement Learning: Theory and Algorithms | Jun 20, 2024 | Learning TheoryQ-Learning | —Unverified | 0 |
| A Reinforcement Learning Perspective on the Optimal Control of Mutation Probabilities for the (1+1) Evolutionary Algorithm: First Results on the OneMax Problem | May 9, 2019 | Evolutionary AlgorithmsQ-Learning | —Unverified | 0 |
| A storage expansion planning framework using reinforcement learning and simulation-based optimization | Jan 10, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| A short variational proof of equivalence between policy gradients and soft Q learning | Dec 22, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Adapting Double Q-Learning for Continuous Reinforcement Learning | Sep 25, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks | May 20, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Toward Packet Routing with Fully-distributed Multi-agent Deep Reinforcement Learning | May 9, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Actuator Trajectory Planning for UAVs with Overhead Manipulator using Reinforcement Learning | Aug 24, 2023 | Motion PlanningNavigate | —Unverified | 0 |
| A Flexible Framework for Incorporating Patient Preferences Into Q-Learning | Jul 22, 2023 | Q-Learning | —Unverified | 0 |
| ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning | Dec 22, 2024 | D4RLQ-Learning | —Unverified | 0 |
| Artificial Intelligence and Auction Design | Feb 12, 2022 | Q-Learning | —Unverified | 0 |
| A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation | Jun 6, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation | Dec 10, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning | Jul 3, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A finite time analysis of distributed Q-learning | May 23, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| A Finite Sample Complexity Bound for Distributionally Robust Q-learning | Feb 26, 2023 | Q-Learning | —Unverified | 0 |
| Active Perception and Representation for Robotic Manipulation | Mar 15, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| An Agile Adaptation Method for Multi-mode Vehicle Communication Networks | Jul 18, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Artificial Intelligence and Dual Contract | Mar 22, 2023 | Q-Learning | —Unverified | 0 |
| A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning | Jan 16, 2022 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |