| Active inference: demystified and compared | Sep 24, 2019 | Atari GamesOpenAI Gym | CodeCode Available | 0 | 5 |
| Synthesis of Temporally-Robust Policies for Signal Temporal Logic Tasks using Reinforcement Learning | Dec 10, 2023 | Q-Learning | CodeCode Available | 0 | 5 |
| Taming the Noise in Reinforcement Learning via Soft Updates | Dec 28, 2015 | Q-Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation | May 31, 2024 | Q-Learning | CodeCode Available | 0 | 5 |
| Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments | Oct 9, 2020 | Incremental LearningQ-Learning | CodeCode Available | 0 | 5 |
| Performing Deep Recurrent Double Q-Learning for Atari Games | Aug 16, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| A Semantic-Aware Multiple Access Scheme for Distributed, Dynamic 6G-Based Applications | Jan 12, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| The Effects of Memory Replay in Reinforcement Learning | Oct 18, 2017 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Agent Performing Autonomous Stock Trading under Good and Bad Situations | Jun 6, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Towards Better Interpretability in Deep Q-Networks | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| A Hysteretic Q-learning Coordination Framework for Emerging Mobility Systems in Smart Cities | Nov 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Tutorial Introduction to Reinforcement Learning | Apr 3, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Attitude Control of Highly Maneuverable Aircraft Using an Improved Q-learning | Oct 22, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| A Hybrid Q-Learning Sine-Cosine-based Strategy for Addressing the Combinatorial Test Suite Minimization Problem | Apr 27, 2018 | Q-Learning | —Unverified | 0 | 0 |
| Adaptive Stochastic Resource Control: A Machine Learning Approach | Jan 15, 2014 | BIG-bench Machine LearningClustering | —Unverified | 0 | 0 |
| A Theory of Regularized Markov Decision Processes | Jan 31, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Theoretical Analysis of Deep Q-Learning | Jan 1, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A Hybrid PAC Reinforcement Learning Algorithm | Sep 5, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Technique to Create Weaker Abstract Board Game Agents via Reinforcement Learning | Sep 1, 2022 | Board GamesQ-Learning | —Unverified | 0 | 0 |
| Asynchronous Stochastic Approximation and Average-Reward Reinforcement Learning | Sep 5, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Graph Attention Learning Approach to Antenna Tilt Optimization | Dec 27, 2021 | Graph AttentionQ-Learning | —Unverified | 0 | 0 |
| Adaptive Services Function Chain Orchestration For Digital Health Twin Use Cases: Heuristic-boosted Q-Learning Approach | Apr 25, 2023 | Q-LearningScheduling | —Unverified | 0 | 0 |
| A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control | Aug 10, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets | Jan 20, 2023 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Unsynchronized Decentralized Q-Learning: Two Timescale Analysis By Persistence | Aug 7, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Agnostic Q-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity | Dec 1, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Asymptotics of Reinforcement Learning with Neural Networks | Nov 13, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Asymptotic regularity of a generalised stochastic Halpern scheme with applications | Nov 7, 2024 | Q-LearningStochastic Optimization | —Unverified | 0 | 0 |
| Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity | Feb 17, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Adaptive Q-learning for Interaction-Limited Reinforcement Learning | Sep 29, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Deep Q Learning Driven CT Pancreas Segmentation with Geometry-Aware U-Net | Apr 19, 2019 | Medical Image AnalysisPancreas Segmentation | —Unverified | 0 | 0 |
| Asymptotic Convergence and Performance of Multi-Agent Q-Learning Dynamics | Jan 23, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Deep Q-Learning-based Distribution Network Reconfiguration for Reliability Improvement | May 2, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A review of motion planning algorithms for intelligent robotics | Feb 4, 2021 | Motion PlanningQ-Learning | —Unverified | 0 | 0 |
| Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance | Nov 17, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Deep Primal-Dual Reinforcement Learning: Accelerating Actor-Critic using Bellman Duality | Dec 7, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Deep Q-Learning for Directed Acyclic Graph Generation | Jun 5, 2019 | Deep Reinforcement LearningGraph Generation | —Unverified | 0 | 0 |
| A study on a Q-Learning algorithm application to a manufacturing assembly problem | Apr 17, 2023 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 | 0 |
| Deep Q-Learning for Same-Day Delivery with Vehicles and Drones | Oct 25, 2019 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Deep Q-Learning for Self-Organizing Networks Fault Management and Radio Performance Improvement | Jul 10, 2017 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| A study of first-passage time minimization via Q-learning in heated gridworlds | Oct 5, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Deep Q Learning from Dynamic Demonstration with Behavioral Cloning | Jan 1, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 | 0 |
| Deep Q-Learning Market Makers in a Multi-Agent Simulated Stock Market | Dec 8, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Deep Q-learning of global optimizer of multiply model parameters for viscoelastic imaging | Apr 1, 2022 | Decision MakingDiagnostic | —Unverified | 0 | 0 |
| Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task | Jun 2, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Deep Q-Learning with Gradient Target Tracking | Mar 20, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Deep Q-Learning with Low Switching Cost | Jan 1, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment | May 23, 2019 | OpenAI GymQ-Learning | —Unverified | 0 | 0 |
| A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm | Aug 9, 2024 | Q-Learning | —Unverified | 0 | 0 |