| Double Q-PID algorithm for mobile robot control | Nov 1, 2018 | Active LearningQ-Learning | CodeCode Available | 0 | 5 |
| A Deep Q-Learning Agent for the L-Game with Variable Batch Training | Feb 17, 2018 | Q-LearningSelf-Learning | CodeCode Available | 0 | 5 |
| Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning | Sep 10, 2024 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| A Machine with Short-Term, Episodic, and Semantic Memory Systems | Dec 5, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Collaborative Multi-BS Power Management for Dense Radio Access Network using Deep Reinforcement Learning | Apr 17, 2023 | Deep Reinforcement LearningManagement | CodeCode Available | 0 | 5 |
| An intelligent financial portfolio trading strategy using deep Q-learning | Jul 8, 2019 | Q-Learning | CodeCode Available | 0 | 5 |
| DRL4AOI: A DRL Framework for Semantic-aware AOI Segmentation in Location-Based Services | Dec 6, 2024 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Combining No-regret and Q-learning | Oct 7, 2019 | counterfactualQ-Learning | CodeCode Available | 0 | 5 |
| A disembodied developmental robotic agent called Samu Bátfai | Nov 9, 2015 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning | Jul 5, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Goal-Conditioned Q-Learning as Knowledge Distillation | Aug 28, 2022 | Knowledge DistillationQ-Learning | CodeCode Available | 0 | 5 |
| DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation | Jan 9, 2018 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 0 | 5 |
| DeepTPI: Test Point Insertion with Deep Reinforcement Learning | Jun 7, 2022 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 | 5 |
| Meta-Value Learning: a General Framework for Learning with Learning Awareness | Jul 17, 2023 | Q-Learning | CodeCode Available | 0 | 5 |
| Active inference: demystified and compared | Sep 24, 2019 | Atari GamesOpenAI Gym | CodeCode Available | 0 | 5 |
| Balancing Value Underestimation and Overestimation with Realistic Actor-Critic | Oct 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| A Deep Learning Approach to Grasping the Invisible | Sep 11, 2019 | Deep LearningQ-Learning | CodeCode Available | 0 | 5 |
| Compressed Federated Reinforcement Learning with a Generative Model | Mar 26, 2024 | modelQ-Learning | CodeCode Available | 0 | 5 |
| Designing Neural Network Architectures using Reinforcement Learning | Nov 7, 2016 | General Classificationimage-classification | CodeCode Available | 0 | 5 |
| Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning | Jul 8, 2021 | Hierarchical Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments | Feb 24, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Momentum-based Accelerated Q-learning | Oct 23, 2019 | Atari GamesQ-Learning | CodeCode Available | 0 | 5 |
| Deep-Q Learning with Hybrid Quantum Neural Network on Solving Maze Problems | Apr 20, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning with a Natural Language Action Space | Nov 14, 2015 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks | Mar 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep reinforcement learning for time series: playing idealized trading games | Mar 11, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods | Feb 28, 2018 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 | 5 |
| Deterministic Implementations for Reproducibility in Deep Reinforcement Learning | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Oct 27, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 | 5 |
| AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization | May 28, 2024 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning | Feb 13, 2024 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Control of Probabilistic Boolean Networks | Sep 7, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Imbalanced Classification | Jan 5, 2019 | ClassificationDecision Making | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Multi-class Imbalanced Training | May 24, 2022 | Deep Reinforcement Learningimbalanced classification | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning Algorithms for Option Hedging | Apr 7, 2025 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Recurrent Q-Learning vs Deep Q-Learning on a simple Partially Observable Markov Decision Process with Minecraft | Mar 11, 2019 | MinecraftQ-Learning | CodeCode Available | 0 | 5 |
| Deep Quality-Value (DQV) Learning | Sep 30, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| DeepQTest: Testing Autonomous Driving Systems with Reinforcement Learning and Real-world Weather Data | Oct 8, 2023 | Autonomous DrivingQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning Based Parameter Control in Differential Evolution | May 20, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Optimal Stopping with Application in Financial Engineering | May 19, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Q-learning: a robust control approach | Jan 21, 2022 | OpenAI GymQ-Learning | CodeCode Available | 0 | 5 |
| Deep Ordinal Reinforcement Learning | May 6, 2019 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| Orchestrated Value Mapping for Reinforcement Learning | Mar 14, 2022 | Ensemble LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Q-Learning based Reinforcement Learning Approach for Network Intrusion Detection | Nov 27, 2021 | Intrusion DetectionNetwork Intrusion Detection | CodeCode Available | 0 | 5 |
| Automaton-Guided Curriculum Generation for Reinforcement Learning Agents | Apr 11, 2023 | Decision MakingQ-Learning | CodeCode Available | 0 | 5 |
| Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation | Jul 24, 2023 | GPUQ-Learning | CodeCode Available | 0 | 5 |
| PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning | Jul 16, 2020 | Policy Gradient MethodsQ-Learning | CodeCode Available | 0 | 5 |
| Performing Deep Recurrent Double Q-Learning for Atari Games | Aug 16, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| ADDQ: Adaptive Distributional Double Q-Learning | Jun 24, 2025 | Distributional Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning | Dec 18, 2017 | Deep Reinforcement LearningEvolutionary Algorithms | CodeCode Available | 0 | 5 |