| The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning | Oct 16, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Entropy Regularized Reinforcement Learning with Cascading Networks | Oct 16, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| CUP: Critic-Guided Policy Reuse | Oct 15, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| DyFEn: Agent-Based Fee Setting in Payment Channel Networks | Oct 15, 2022 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion | Oct 14, 2022 | Deep Reinforcement LearningQuantization | CodeCode Available | 0 |
| Adaptive patch foraging in deep reinforcement learning agents | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Scalable Finite Difference Method for Deep Reinforcement Learning | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations | Oct 14, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning | Oct 14, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Observed Adversaries in Deep Reinforcement Learning | Oct 13, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |