| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Oct 30, 2023 | Decision MakingOffline RL | CodeCode Available | 1 |
| GAIL-PT: A Generic Intelligent Penetration Testing Framework with Generative Adversarial Imitation Learning | Apr 5, 2022 | Imitation LearningQ-Learning | CodeCode Available | 1 |
| Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls | Oct 27, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation | May 4, 2021 | Bayesian OptimizationQ-Learning | CodeCode Available | 1 |
| Adaptive Contention Window Design using Deep Q-learning | Nov 18, 2020 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? | Sep 26, 2019 | Feature EngineeringQ-Learning | CodeCode Available | 1 |
| Is Q-learning Provably Efficient? | Jul 10, 2018 | Q-LearningReinforcement Learning | CodeCode Available | 1 |
| Laser Learning Environment: A new environment for coordination-critical multi-agent tasks | Apr 4, 2024 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Learning the Markov Decision Process in the Sparse Gaussian Elimination | Sep 30, 2021 | Combinatorial OptimizationQ-Learning | CodeCode Available | 1 |
| LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning | Mar 1, 2023 | Continuous ControlImitation Learning | CodeCode Available | 1 |
| MAN: Multi-Action Networks Learning | Sep 19, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer | Jun 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Benchmarking Batch Deep Reinforcement Learning Algorithms | Oct 3, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Benchmarking Deep Graph Generative Models for Optimizing New Drug Molecules for COVID-19 | Feb 9, 2021 | BenchmarkingQ-Learning | CodeCode Available | 1 |
| Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? | Dec 1, 2020 | Feature EngineeringQ-Learning | CodeCode Available | 1 |
| Automated Cloud Provisioning on AWS using Deep Reinforcement Learning | Sep 13, 2017 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| A Stochastic Game Framework for Efficient Energy Management in Microgrid Networks | Feb 6, 2020 | energy managementenergy trading | CodeCode Available | 1 |
| Addressing Function Approximation Error in Actor-Critic Methods | Feb 26, 2018 | Continuous ControlOpenAI Gym | CodeCode Available | 1 |
| Boosting Continuous Control with Consistency Policy | Oct 10, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| When should we prefer Decision Transformers for Offline Reinforcement Learning? | May 23, 2023 | D4RLImitation Learning | CodeCode Available | 1 |
| Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation | Jun 23, 2021 | Continuous ControlQ-Learning | CodeCode Available | 1 |
| Backprop-Free Reinforcement Learning with Active Neural Generative Coding | Jul 10, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Conservative Q-Learning for Offline Reinforcement Learning | Jun 8, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| An Optimistic Perspective on Offline Deep Reinforcement Learning | Jan 1, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |