| Deep Active Inference for Partially Observable MDPs | Sep 8, 2020 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials | Oct 11, 2022 | Offline RLQ-Learning | CodeCode Available | 1 |
| Towards Universal and Black-Box Query-Response Only Attack on LLMs with QROA | Jun 4, 2024 | Q-Learning | CodeCode Available | 1 |
| Randomized Ensembled Double Q-Learning: Learning Fast Without a Model | Jan 15, 2021 | MuJoCoQ-Learning | CodeCode Available | 1 |
| Reasoning with Latent Diffusion in Offline Reinforcement Learning | Sep 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Dec 1, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Research on Robot Path Planning Based on Reinforcement Learning | Apr 22, 2024 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning | May 2, 2022 | Data AugmentationQ-Learning | CodeCode Available | 1 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Reward Machines for Cooperative Multi-Agent Reinforcement Learning | Jul 3, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Adaptive Contention Window Design using Deep Q-learning | Nov 18, 2020 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation | Jun 23, 2021 | Continuous ControlQ-Learning | CodeCode Available | 1 |
| Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration | Jun 20, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| A Search-Based Testing Approach for Deep Reinforcement Learning Agents | Jun 15, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives | Dec 8, 2021 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Backprop-Free Reinforcement Learning with Active Neural Generative Coding | Jul 10, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | Jan 4, 2018 | Continuous ControlDecision Making | CodeCode Available | 1 |
| Solving Continuous Control via Q-learning | Oct 22, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning | Feb 28, 2017 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation | Jul 1, 2021 | Data AugmentationQ-Learning | CodeCode Available | 1 |
| Automated Cloud Provisioning on AWS using Deep Reinforcement Learning | Sep 13, 2017 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| An Optimistic Perspective on Offline Reinforcement Learning | Jul 10, 2019 | Atari GamesDiversity | CodeCode Available | 1 |
| Table2Charts: Recommending Charts by Learning Shared Table Representations | Aug 24, 2020 | Q-LearningRecommendation Systems | CodeCode Available | 1 |
| TempoRL: Learning When to Act | Jun 9, 2021 | Continuous ControlQ-Learning | CodeCode Available | 1 |
| Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error | Feb 3, 2024 | Adversarial RobustnessDeep Reinforcement Learning | CodeCode Available | 1 |
| Towards Robust Offline Reinforcement Learning under Diverse Data Corruption | Oct 19, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning | May 27, 2024 | Data AugmentationQ-Learning | CodeCode Available | 1 |
| An Optimistic Perspective on Offline Deep Reinforcement Learning | Jan 1, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| A Stochastic Game Framework for Efficient Energy Management in Microgrid Networks | Feb 6, 2020 | energy managementenergy trading | CodeCode Available | 1 |
| Benchmarking Deep Graph Generative Models for Optimizing New Drug Molecules for COVID-19 | Feb 9, 2021 | BenchmarkingQ-Learning | CodeCode Available | 1 |
| Boosting Continuous Control with Consistency Policy | Oct 10, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning | Mar 9, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? | Dec 1, 2020 | Feature EngineeringQ-Learning | CodeCode Available | 1 |
| Benchmarking Batch Deep Reinforcement Learning Algorithms | Oct 3, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman Problem | Dec 8, 2020 | Combinatorial OptimizationQ-Learning | CodeCode Available | 1 |
| Addressing Function Approximation Error in Actor-Critic Methods | Feb 26, 2018 | Continuous ControlOpenAI Gym | CodeCode Available | 1 |
| Continuous Deep Q-Learning with Model-based Acceleration | Mar 2, 2016 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deep Inverse Q-learning with Constraints | Aug 4, 2020 | Q-Learning | CodeCode Available | 1 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning-based Intelligent Traffic Signal Controls with Optimized CO2 emissions | Oct 19, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning with Double Q-learning | Sep 22, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| When should we prefer Decision Transformers for Offline Reinforcement Learning? | May 23, 2023 | D4RLImitation Learning | CodeCode Available | 1 |
| Diffusion Policies creating a Trust Region for Offline Reinforcement Learning | May 30, 2024 | D4RLDenoising | CodeCode Available | 1 |
| Discriminator Soft Actor Critic without Extrinsic Rewards | Jan 19, 2020 | Imitation LearningQ-Learning | CodeCode Available | 1 |
| Distilling Reinforcement Learning Tricks for Video Games | Jul 1, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies | Apr 20, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization | Mar 28, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models | Oct 9, 2020 | Deep Reinforcement LearningEpidemiology | CodeCode Available | 1 |
| Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning | May 17, 2021 | Offline RLQ-Learning | CodeCode Available | 1 |