| UCB Momentum Q-learning: Correcting the bias without forgetting | Mar 1, 2021 | Q-Learning | CodeCode Available | 0 |
| PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning | Jul 16, 2020 | Policy Gradient MethodsQ-Learning | CodeCode Available | 0 |
| Performing Deep Recurrent Double Q-Learning for Atari Games | Aug 16, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL | Dec 17, 2020 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Single-partition adaptive Q-learning | Jul 14, 2020 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Active inference: demystified and compared | Sep 24, 2019 | Atari GamesOpenAI Gym | CodeCode Available | 0 |
| BlockQNN: Efficient Block-wise Neural Network Architecture Generation | Aug 16, 2018 | GPUimage-classification | CodeCode Available | 0 |
| From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries | Mar 27, 2024 | Autonomous NavigationDecision Making | CodeCode Available | 0 |
| Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning | Jul 1, 2018 | Hierarchical Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments | Apr 30, 2023 | Motion PlanningQ-Learning | CodeCode Available | 0 |
| Automatic Data Augmentation by Learning the Deterministic Policy | Oct 18, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 |
| DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation | Jan 9, 2018 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 0 |
| GAN Q-learning | May 13, 2018 | Distributional Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing | Jul 15, 2025 | Knowledge TracingMath | CodeCode Available | 0 |
| Revisiting Fundamentals of Experience Replay | Jul 13, 2020 | Deep Reinforcement LearningDQN Replay Dataset | CodeCode Available | 0 |
| Bridging the Gap Between Target Networks and Functional Regularization | Jun 4, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Comprehensible Context-driven Text Game Playing | May 6, 2019 | Q-Learning | CodeCode Available | 0 |
| Generalized Speedy Q-learning | Nov 1, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Generalized Value Iteration Networks: Life Beyond Lattices | Jun 8, 2017 | Q-Learning | CodeCode Available | 0 |
| Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks | Apr 8, 2023 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| Revisiting Prioritized Experience Replay: A Value Perspective | Feb 5, 2021 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning | Dec 18, 2017 | Deep Reinforcement LearningEvolutionary Algorithms | CodeCode Available | 0 |
| GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning | Mar 2, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Revisiting the Softmax Bellman Operator: New Benefits and New Perspective | Dec 2, 2018 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings | Oct 29, 2020 | Change Point DetectionOff-policy evaluation | CodeCode Available | 0 |
| Goal-Conditioned Q-Learning as Knowledge Distillation | Aug 28, 2022 | Knowledge DistillationQ-Learning | CodeCode Available | 0 |
| Reward Delay Attacks on Deep Reinforcement Learning | Sep 8, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Goal Recognition as Reinforcement Learning | Feb 13, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective | Jun 29, 2023 | Feature EngineeringQ-Learning | CodeCode Available | 0 |
| DeepTPI: Test Point Insertion with Deep Reinforcement Learning | Jun 7, 2022 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| Composable Deep Reinforcement Learning for Robotic Manipulation | Mar 19, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Graph Backup: Data Efficient Backup Exploiting Markovian Transitions | May 31, 2022 | Atari Gamescounterfactual | CodeCode Available | 0 |
| Automata Learning meets Shielding | Dec 4, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Dynamic-Weighted Simplex Strategy for Learning Enabled Cyber Physical Systems | Feb 6, 2019 | Autonomous DrivingQ-Learning | CodeCode Available | 0 |
| Momentum-based Accelerated Q-learning | Oct 23, 2019 | Atari GamesQ-Learning | CodeCode Available | 0 |
| SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning | Feb 20, 2024 | Imitation LearningQ-Learning | CodeCode Available | 0 |
| Monte Carlo Q-learning for General Game Playing | Feb 16, 2018 | Board GamesQ-Learning | CodeCode Available | 0 |
| Deep Coordination Graphs | Sep 27, 2019 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Group Equivariant Deep Reinforcement Learning | Jul 1, 2020 | Deep Reinforcement LearningInductive Bias | CodeCode Available | 0 |
| Autoequivariant Network Search via Group Decomposition | Apr 10, 2021 | Inductive BiasNeural Architecture Search | CodeCode Available | 0 |
| Multi-Agent Advisor Q-Learning | Oct 26, 2021 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Action Candidate Driven Clipped Double Q-learning for Discrete and Continuous Action Tasks | Mar 22, 2022 | Q-Learning | CodeCode Available | 0 |
| Playing 2048 With Reinforcement Learning | Oct 20, 2021 | Playing the Game of 2048Q-Learning | CodeCode Available | 0 |
| Deep Active Inference for Pixel-Based Discrete Control: Evaluation on the Car Racing Problem | Sep 9, 2021 | Car RacingQ-Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning with a Natural Language Action Space | Nov 14, 2015 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL | Jul 20, 2024 | Few-Shot Text ClassificationQ-Learning | CodeCode Available | 0 |
| Decoding fairness: a reinforcement learning perspective | Dec 20, 2024 | FairnessImitation Learning | CodeCode Available | 0 |
| Deep-Q Learning with Hybrid Quantum Neural Network on Solving Maze Problems | Apr 20, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks | Aug 1, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Traffic Light Control with Reinforcement Learning | Aug 28, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |