| Robust Q-learning Algorithm for Markov Decision Processes under Wasserstein Uncertainty | Sep 30, 2022 | Q-Learning | CodeCode Available | 1 |
| Revisiting Discrete Soft Actor-Critic | Sep 21, 2022 | Atari GamesQ-Learning | CodeCode Available | 1 |
| MAN: Multi-Action Networks Learning | Sep 19, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games | Jul 18, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning | Jul 12, 2022 | Lifelong learningPolicy Gradient Methods | CodeCode Available | 1 |
| Reinforced Lin-Kernighan-Helsgaun Algorithms for the Traveling Salesman Problems | Jul 8, 2022 | Combinatorial OptimizationQ-Learning | CodeCode Available | 1 |
| On the Learning and Learnability of Quasimetrics | Jun 30, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer | Jun 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration | Jun 20, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| A Search-Based Testing Approach for Deep Reinforcement Learning Agents | Jun 15, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Mildly Conservative Q-Learning for Offline Reinforcement Learning | Jun 9, 2022 | D4RLQ-Learning | CodeCode Available | 1 |
| CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning | May 2, 2022 | Data AugmentationQ-Learning | CodeCode Available | 1 |
| GAIL-PT: A Generic Intelligent Penetration Testing Framework with Generative Adversarial Imitation Learning | Apr 5, 2022 | Imitation LearningQ-Learning | CodeCode Available | 1 |
| Microservice Deployment in Edge Computing Based on Deep Q Learning | Feb 11, 2022 | Edge-computingQ-Learning | CodeCode Available | 1 |
| Addressing Maximization Bias in Reinforcement Learning with Two-Sample Testing | Jan 20, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Safety and Liveness Guarantees through Reach-Avoid Reinforcement Learning | Dec 23, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives | Dec 8, 2021 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Dec 1, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Offline Reinforcement Learning with Implicit Q-Learning | Oct 12, 2021 | D4RLOffline RL | CodeCode Available | 1 |
| Dropout Q-Functions for Doubly Efficient Reinforcement Learning | Oct 5, 2021 | Computational EfficiencyQ-Learning | CodeCode Available | 1 |
| Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble | Oct 4, 2021 | Adroid door-clonedAdroid door-human | CodeCode Available | 1 |
| Learning the Markov Decision Process in the Sparse Gaussian Elimination | Sep 30, 2021 | Combinatorial OptimizationQ-Learning | CodeCode Available | 1 |
| Offline Reinforcement Learning with In-sample Q-Learning | Sep 29, 2021 | D4RLOffline RL | CodeCode Available | 1 |
| Deep Reinforcement Q-Learning for Intelligent Traffic Signal Control with Partial Detection | Sep 29, 2021 | Q-LearningTraffic Signal Control | CodeCode Available | 1 |
| Backprop-Free Reinforcement Learning with Active Neural Generative Coding | Jul 10, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation | Jul 1, 2021 | Data AugmentationQ-Learning | CodeCode Available | 1 |
| Distilling Reinforcement Learning Tricks for Video Games | Jul 1, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Towards self-organized control: Using neural cellular automata to robustly control a cart-pole agent | Jun 29, 2021 | Q-Learning | CodeCode Available | 1 |
| Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation | Jun 23, 2021 | Continuous ControlQ-Learning | CodeCode Available | 1 |
| IQ-Learn: Inverse soft-Q Learning for Imitation | Jun 23, 2021 | Atari GamesContinuous Control | CodeCode Available | 1 |
| Distributed Heuristic Multi-Agent Path Finding with Communication | Jun 21, 2021 | Multi-Agent Path FindingQ-Learning | CodeCode Available | 1 |
| Efficient (Soft) Q-Learning for Text Generation with Limited Good Data | Jun 14, 2021 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| TempoRL: Learning When to Act | Jun 9, 2021 | Continuous ControlQ-Learning | CodeCode Available | 1 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning | May 31, 2021 | FairnessMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning | May 17, 2021 | Offline RLQ-Learning | CodeCode Available | 1 |
| HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation | May 4, 2021 | Bayesian OptimizationQ-Learning | CodeCode Available | 1 |
| Optimal Market Making by Reinforcement Learning | Apr 8, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning | Feb 16, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Benchmarking Deep Graph Generative Models for Optimizing New Drug Molecules for COVID-19 | Feb 9, 2021 | BenchmarkingQ-Learning | CodeCode Available | 1 |
| Acting in Delayed Environments with Non-Stationary Markov Policies | Jan 28, 2021 | Cloud ComputingQ-Learning | CodeCode Available | 1 |
| Randomized Ensembled Double Q-Learning: Learning Fast Without a Model | Jan 15, 2021 | MuJoCoQ-Learning | CodeCode Available | 1 |
| Simulating SQL Injection Vulnerability Exploitation Using Q-Learning Reinforcement Learning Agents | Jan 8, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Multi-Agent Trust Region Learning | Jan 1, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 |
| Combining Reinforcement Learning with Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman Problem | Dec 8, 2020 | Combinatorial OptimizationQ-Learning | CodeCode Available | 1 |
| Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? | Dec 1, 2020 | Feature EngineeringQ-Learning | CodeCode Available | 1 |
| Adaptive Contention Window Design using Deep Q-learning | Nov 18, 2020 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls | Oct 27, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning Guidance Rewards with Trajectory-space Smoothing | Oct 23, 2020 | AttributeDeep Reinforcement Learning | CodeCode Available | 1 |
| Multi-Agent Collaboration via Reward Attribution Decomposition | Oct 16, 2020 | Dota 2Multi-agent Reinforcement Learning | CodeCode Available | 1 |