| Deep Learning of Intrinsically Motivated Options in the Arcade Learning Environment | Sep 29, 2021 | Atari GamesBenchmarking | —Unverified | 0 |
| Deep Dynamic Attention Model with Gate Mechanism for Solving Time-dependent Vehicle Routing Problems | Sep 29, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Particle Based Stochastic Policy Optimization | Sep 29, 2021 | Deep Reinforcement LearningMuJoCo Games | —Unverified | 0 |
| Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game | Sep 29, 2021 | counterfactualDeep Reinforcement Learning | —Unverified | 0 |
| Understanding the Generalization Gap in Visual Reinforcement Learning | Sep 29, 2021 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| P4O: Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization | Sep 29, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis | Sep 29, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Variational oracle guiding for reinforcement learning | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Interpreting Reinforcement Policies through Local Behaviors | Sep 29, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Experience Replay More When It's a Key Transition in Deep Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| WaveCorr: Deep Reinforcement Learning with Permutation Invariant Policy Networks for Portfolio Management | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| On the benefits of deep RL in accelerated MRI sampling | Sep 29, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| CausalDyna: Improving Generalization of Dyna-style Reinforcement Learning via Counterfactual-Based Data Augmentation | Sep 29, 2021 | counterfactualData Augmentation | —Unverified | 0 |
| Assessing Deep Reinforcement Learning Policies via Natural Corruptions at the Edge of Imperceptibility | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Symmetric Machine Theory of Mind | Sep 29, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Adversarial Style Transfer for Robust Policy Optimization in Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Multi-batch Reinforcement Learning via Sample Transfer and Imitation Learning | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| An Optics Controlling Environment and Reinforcement Learning Benchmarks | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Equal Risk Option Pricing and Hedging under Dynamic Expectile Risk Measures | Sep 29, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| PDQN - A Deep Reinforcement Learning Method for Planning with Long Delays: Optimization of Manufacturing Dispatching | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Maximizing Ensemble Diversity in Deep Reinforcement Learning | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| Programmatic Reinforcement Learning without Oracles | Sep 29, 2021 | Bilevel OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Task-driven Discovery of Perceptual Schemas for Generalization in Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement LearningObject | —Unverified | 0 |
| MARNET: Backdoor Attacks against Value-Decomposition Multi-Agent Reinforcement Learning | Sep 29, 2021 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| Variance Reduced Domain Randomization for Policy Gradient | Sep 29, 2021 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |