| Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning | Sep 10, 2024 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| On Solving the 2-Dimensional Greedy Shooter Problem for UAVs | Nov 2, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Q-Learning Lagrange Policies for Multi-Action Restless Bandits | Jun 22, 2021 | Multi-Armed BanditsQ-Learning | CodeCode Available | 0 |
| Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments | Feb 24, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| ADDQ: Adaptive Distributional Double Q-Learning | Jun 24, 2025 | Distributional Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning To Play Atari Games Using Dueling Q-Learning and Hebbian Plasticity | May 22, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening | Nov 5, 2016 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Using deep Q-learning to understand the tax evasion behavior of risk-averse firms | Jan 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| On the Estimation Bias in Double Q-Learning | Sep 29, 2021 | Q-LearningValue prediction | CodeCode Available | 0 |
| Self-Learning Cloud Controllers: Fuzzy Q-Learning for Knowledge Evolution | Jul 2, 2015 | Q-LearningSelf-Learning | CodeCode Available | 0 |
| Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors | Jun 28, 2024 | Car RacingQ-Learning | CodeCode Available | 0 |
| Self Punishment and Reward Backfill for Deep Q-Learning | Apr 10, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Enhancing Robot Assistive Behaviour with Reinforcement Learning and Theory of Mind | Nov 11, 2024 | Q-Learning | CodeCode Available | 0 |
| Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning | Jul 5, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation | Sep 29, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Double Q-PID algorithm for mobile robot control | Nov 1, 2018 | Active LearningQ-Learning | CodeCode Available | 0 |
| Adaptive Symmetric Reward Noising for Reinforcement Learning | May 24, 2019 | Autonomous DrivingQ-Learning | CodeCode Available | 0 |
| Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic Arm | Aug 28, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Stochastic approximation with cone-contractive operators: Sharp _-bounds for Q-learning | May 15, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Distributionally Robust Deep Q-Learning | May 25, 2025 | Q-Learning | CodeCode Available | 0 |
| Least-Squares Policy Iteration | Dec 4, 2003 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks | Feb 12, 2024 | Ensemble LearningManagement | CodeCode Available | 0 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 |
| Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods | Sep 22, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A critical assessment of reinforcement learning methods for microswimmer navigation in complex flows | May 8, 2025 | Autonomous NavigationHyperparameter Optimization | CodeCode Available | 0 |
| Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms | Sep 1, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Multi-intention Inverse Q-learning for Interpretable Behavior Representation | Nov 23, 2023 | Decision MakingQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning with A* and a Deep Heuristic | Nov 19, 2018 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Evolution of cooperation in a bimodal mixture of conditional cooperators | Feb 11, 2025 | Q-Learning | CodeCode Available | 0 |
| Route Planning for Last-Mile Deliveries Using Mobile Parcel Lockers: A Hybrid Q-Learning Network Approach | Sep 9, 2022 | Q-Learning | CodeCode Available | 0 |
| Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents | Feb 6, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNet | Dec 15, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks | Nov 21, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Reinforcement Learning with Deep Energy-Based Policies | Feb 27, 2017 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks | Oct 6, 2018 | AllMontezuma's Revenge | CodeCode Available | 0 |
| Reinforcement Learning with Dynamic Boltzmann Softmax Updates | Mar 14, 2019 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning | Feb 13, 2024 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Explainable and Safe Reinforcement Learning for Autonomous Air Mobility | Nov 24, 2022 | Adversarial AttackDeep Reinforcement Learning | CodeCode Available | 0 |
| QMR:Q-learning based Multi-objective optimization Routing protocol for Flying Ad Hoc Networks | Nov 27, 2019 | Q-Learning | CodeCode Available | 0 |
| Lookahead-Bounded Q-Learning | Jun 28, 2020 | Q-Learning | CodeCode Available | 0 |
| Introspective Experience Replay: Look Back When Surprised | Jun 7, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Welfare and Fairness in Multi-objective Reinforcement Learning | Nov 30, 2022 | FairnessMulti-Objective Reinforcement Learning | CodeCode Available | 0 |
| Low-rank State-action Value-function Approximation | Apr 18, 2021 | Q-Learning | CodeCode Available | 0 |
| Deep Quality-Value (DQV) Learning | Sep 30, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| M^2DQN: A Robust Method for Accelerating Deep Q-learning Network | Sep 16, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces | Oct 17, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games | Sep 24, 2020 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environment | Jul 20, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Diagnosing Bottlenecks in Deep Q-learning Algorithms | Feb 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |