| URLB: Unsupervised Reinforcement Learning Benchmark | Oct 28, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning Domain Invariant Representations in Goal-conditioned Block MDPs | Oct 27, 2021 | Deep Reinforcement LearningDomain Generalization | CodeCode Available | 1 |
| Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling | Oct 27, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Learning Diverse Policies in MOBA Games via Macro-Goals | Oct 27, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Comparing Heuristics, Constraint Optimization, and Reinforcement Learning for an Industrial 2D Packing Problem | Oct 27, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Towards Robust Bisimulation Metric Learning | Oct 27, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning Collaborative Policies to Solve NP-hard Routing Problems | Oct 26, 2021 | Deep Reinforcement LearningTraveling Salesman Problem | CodeCode Available | 1 |
| The Difficulty of Passive Learning in Deep Reinforcement Learning | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Neural PPO-Clip Attains Global Optimality: A Hinge Loss Perspective | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Applications of Multi-Agent Reinforcement Learning in Future Internet: A Comprehensive Survey | Oct 26, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments | Oct 25, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Uniformly Conservative Exploration in Reinforcement Learning | Oct 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Recurrent Off-policy Baselines for Memory-based Continuous Control | Oct 25, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks | Oct 24, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Distributed Deep Reinforcement Learning Technique for Application Placement in Edge and Fog Computing Environments | Oct 24, 2021 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning | Oct 23, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models | Oct 22, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| Anti-Concentrated Confidence Bonuses for Scalable Exploration | Oct 21, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations | Oct 21, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Locality-Sensitive Experience Replay for Online Recommendation | Oct 21, 2021 | Deep Reinforcement LearningRecommendation Systems | —Unverified | 0 |
| Neuro-Symbolic Reinforcement Learning with First-Order Logic | Oct 21, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Generative Models in Engineering Design: A Review | Oct 21, 2021 | Deep Reinforcement LearningDesign Synthesis | —Unverified | 0 |
| CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric | Oct 20, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |