| Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic | Nov 7, 2016 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Factors of Influence of the Overestimation Bias of Q-Learning | Oct 11, 2022 | Q-Learning | CodeCode Available | 0 |
| MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal Control | Dec 20, 2024 | Graph AttentionQ-Learning | CodeCode Available | 0 |
| ConQUR: Mitigating Delusional Bias in Deep Q-learning | Feb 27, 2020 | Atari GamesQ-Learning | CodeCode Available | 0 |
| DeepQTest: Testing Autonomous Driving Systems with Reinforcement Learning and Real-world Weather Data | Oct 8, 2023 | Autonomous DrivingQ-Learning | CodeCode Available | 0 |
| Making Deep Q-learning methods robust to time discretization | Jan 28, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Bootstrapped Meta-Learning | Sep 9, 2021 | Efficient ExplorationFew-Shot Learning | CodeCode Available | 0 |
| Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning | Jul 8, 2021 | Hierarchical Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Deep Q-learning from Demonstrations | Apr 12, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning with Low-Complexity Liquid State Machines | Jun 4, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| The Effects of Memory Replay in Reinforcement Learning | Oct 18, 2017 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Compressed Federated Reinforcement Learning with a Generative Model | Mar 26, 2024 | modelQ-Learning | CodeCode Available | 0 |
| Deep Q-Learning for Nash Equilibria: Nash-DQN | Apr 23, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Mastering Percolation-like Games with Deep Learning | May 12, 2023 | Deep LearningQ-Learning | CodeCode Available | 0 |
| Towards Symbolic Reinforcement Learning with Common Sense | Apr 23, 2018 | Common Sense ReasoningDeep Reinforcement Learning | CodeCode Available | 0 |
| A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines | Jun 4, 2020 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Q learning for fooling neural networks | Nov 13, 2018 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Deep Q-Learning based Reinforcement Learning Approach for Network Intrusion Detection | Nov 27, 2021 | Intrusion DetectionNetwork Intrusion Detection | CodeCode Available | 0 |
| An intelligent financial portfolio trading strategy using deep Q-learning | Jul 8, 2019 | Q-Learning | CodeCode Available | 0 |
| Efficient Parallel Reinforcement Learning Framework using the Reactor Model | Dec 7, 2023 | OpenAI GymQ-Learning | CodeCode Available | 0 |
| A Fairness-Oriented Reinforcement Learning Approach for the Operation and Control of Shared Micromobility Services | Mar 23, 2024 | FairnessQ-Learning | CodeCode Available | 0 |
| Remember and Forget for Experience Replay | Jul 16, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Meta-Black-Box-Optimization through Offline Q-function Learning | May 4, 2025 | BenchmarkingMamba | CodeCode Available | 0 |
| UNIQ: Offline Inverse Q-learning for Avoiding Undesirable Demonstrations | Oct 10, 2024 | Imitation LearningQ-Learning | CodeCode Available | 0 |
| Meta-Q-Learning | Sep 30, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Meta-Value Learning: a General Framework for Learning with Learning Awareness | Jul 17, 2023 | Q-Learning | CodeCode Available | 0 |
| Adversarial Learning of a Sampler Based on an Unnormalized Distribution | Jan 3, 2019 | FormQ-Learning | CodeCode Available | 0 |
| Deep Q-learning: a robust control approach | Jan 21, 2022 | OpenAI GymQ-Learning | CodeCode Available | 0 |
| Deep Ordinal Reinforcement Learning | May 6, 2019 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted Regression | May 28, 2024 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Orchestrated Value Mapping for Reinforcement Learning | Mar 14, 2022 | Ensemble LearningQ-Learning | CodeCode Available | 0 |
| Angrier Birds: Bayesian reinforcement learning | Jan 6, 2016 | Efficient ExplorationQ-Learning | CodeCode Available | 0 |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Oct 27, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Offline Contextual Bandits with Overparameterized Models | Jun 27, 2020 | Multi-Armed BanditsQ-Learning | CodeCode Available | 0 |
| An Empirical Study of Deep Reinforcement Learning in Continuing Tasks | Jan 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Simulation of Nanorobots with Artificial Intelligence and Reinforcement Learning for Advanced Cancer Cell Detection and Tracking | Nov 4, 2024 | Cell DetectionNavigate | CodeCode Available | 0 |
| PairVDN - Pair-wise Decomposed Value Functions | Mar 12, 2025 | Q-Learning | CodeCode Available | 0 |
| Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning | May 27, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods | May 8, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Mixed-Integer Optimal Control via Reinforcement Learning: A Case Study on Hybrid Electric Vehicle Energy Management | May 2, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Variation-resistant Q-learning: Controlling and Utilizing Estimation Bias in Reinforcement Learning for Better Performance | Feb 1, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation | Jul 24, 2023 | GPUQ-Learning | CodeCode Available | 0 |
| Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients | Sep 24, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient Minimum Radiation Exposure Pathway | Feb 1, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Boosting Soft Q-Learning by Bounding | Jun 26, 2024 | Q-Learning | CodeCode Available | 0 |
| Automaton-Guided Curriculum Generation for Reinforcement Learning Agents | Apr 11, 2023 | Decision MakingQ-Learning | CodeCode Available | 0 |
| Variations on the Reinforcement Learning performance of Blackjack | Aug 9, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Model-Free Adaptive Optimal Control of Episodic Fixed-Horizon Manufacturing Processes using Reinforcement Learning | Sep 18, 2018 | Model Predictive ControlQ-Learning | CodeCode Available | 0 |
| Deterministic Implementations for Reproducibility in Deep Reinforcement Learning | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Designing Neural Network Architectures using Reinforcement Learning | Nov 7, 2016 | General Classificationimage-classification | CodeCode Available | 0 |