| Bridging the Gap Between Target Networks and Functional Regularization | Jun 4, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer | Feb 4, 2025 | Q-LearningSMAC | CodeCode Available | 0 | 5 |
| Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning | Jul 5, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deterministic Implementations for Reproducibility in Deep Reinforcement Learning | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Designing Neural Network Architectures using Reinforcement Learning | Nov 7, 2016 | General Classificationimage-classification | CodeCode Available | 0 | 5 |
| A Multi-Agent Multi-Environment Mixed Q-Learning for Partially Decentralized Wireless Network Optimization | Sep 24, 2024 | Q-Learning | CodeCode Available | 0 | 5 |
| Active inference: demystified and compared | Sep 24, 2019 | Atari GamesOpenAI Gym | CodeCode Available | 0 | 5 |
| BlockQNN: Efficient Block-wise Neural Network Architecture Generation | Aug 16, 2018 | GPUimage-classification | CodeCode Available | 0 | 5 |
| Double Q-PID algorithm for mobile robot control | Nov 1, 2018 | Active LearningQ-Learning | CodeCode Available | 0 | 5 |
| Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning | Sep 10, 2024 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| Diagnosing Bottlenecks in Deep Q-learning Algorithms | Feb 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| DeepTPI: Test Point Insertion with Deep Reinforcement Learning | Jun 7, 2022 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 | 5 |
| Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations | Mar 6, 2024 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Boosting Soft Q-Learning by Bounding | Jun 26, 2024 | Q-Learning | CodeCode Available | 0 | 5 |
| DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation | Jan 9, 2018 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 0 | 5 |
| Bootstrapped Meta-Learning | Sep 9, 2021 | Efficient ExplorationFew-Shot Learning | CodeCode Available | 0 | 5 |
| Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games | Sep 24, 2020 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Efficient Model-free Reinforcement Learning in Metric Spaces | May 1, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks | Mar 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep reinforcement learning for time series: playing idealized trading games | Mar 11, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods | Feb 28, 2018 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Multi-class Imbalanced Training | May 24, 2022 | Deep Reinforcement Learningimbalanced classification | CodeCode Available | 0 | 5 |
| A Deep Q-Learning Agent for the L-Game with Variable Batch Training | Feb 17, 2018 | Q-LearningSelf-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Optimal Stopping with Application in Financial Engineering | May 19, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep-Q Learning with Hybrid Quantum Neural Network on Solving Maze Problems | Apr 20, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning Based Parameter Control in Differential Evolution | May 20, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action Tasks | May 3, 2021 | Q-Learning | CodeCode Available | 0 | 5 |
| A Machine with Short-Term, Episodic, and Semantic Memory Systems | Dec 5, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Generalized Speedy Q-learning | Nov 1, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Control of Probabilistic Boolean Networks | Sep 7, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| An Empirical Study of Deep Reinforcement Learning in Continuing Tasks | Jan 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning | Mar 2, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Goal Recognition as Reinforcement Learning | Feb 13, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning Algorithms for Option Hedging | Apr 7, 2025 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Imbalanced Classification | Jan 5, 2019 | ClassificationDecision Making | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning with a Natural Language Action Space | Nov 14, 2015 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 | 5 |
| Heuristics, Answer Set Programming and Markov Decision Process for Solving a Set of Spatial Puzzles | Feb 16, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Q-Learning for Nash Equilibria: Nash-DQN | Apr 23, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Q learning for fooling neural networks | Nov 13, 2018 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Q-learning from Demonstrations | Apr 12, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Implications of Decentralized Q-learning Resource Allocation in Wireless Networks | May 30, 2017 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Increasing the Action Gap: New Operators for Reinforcement Learning | Dec 15, 2015 | Atari GamesQ-Learning | CodeCode Available | 0 | 5 |
| Information-Directed Exploration for Deep Reinforcement Learning | Dec 18, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments | Oct 9, 2020 | Incremental LearningQ-Learning | CodeCode Available | 0 | 5 |
| Policy Iterations for Reinforcement Learning Problems in Continuous Time and Space -- Fundamental Theory and Methods | May 9, 2017 | Decision MakingQ-Learning | CodeCode Available | 0 | 5 |
| Balancing Value Underestimation and Overestimation with Realistic Actor-Critic | Oct 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| A Deep Learning Approach to Grasping the Invisible | Sep 11, 2019 | Deep LearningQ-Learning | CodeCode Available | 0 | 5 |
| Angrier Birds: Bayesian reinforcement learning | Jan 6, 2016 | Efficient ExplorationQ-Learning | CodeCode Available | 0 | 5 |
| Deep Q-Learning based Reinforcement Learning Approach for Network Intrusion Detection | Nov 27, 2021 | Intrusion DetectionNetwork Intrusion Detection | CodeCode Available | 0 | 5 |