| Particle Based Stochastic Policy Optimization | Sep 29, 2021 | Deep Reinforcement LearningMuJoCo Games | —Unverified | 0 |
| Deep Reinforcement Learning for Equal Risk Option Pricing and Hedging under Dynamic Expectile Risk Measures | Sep 29, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| The Remarkable Effectiveness of Combining Policy and Value Networks in A*-based Deep RL for AI Planning | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Risk-Sensitive Policy Gradient Method | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Understanding the Generalization Gap in Visual Reinforcement Learning | Sep 29, 2021 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| Mitigation of Adversarial Policy Imitation via Constrained Randomization of Policy (CRoP) | Sep 29, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Symmetric Machine Theory of Mind | Sep 29, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Efficient Reinforcement Learning Experimentation in PyTorch | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| An Optics Controlling Environment and Reinforcement Learning Benchmarks | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reward Shifting for Optimistic Exploration and Conservative Exploitation | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| That Escalated Quickly: Compounding Complexity by Editing Levels at the Frontier of Agent Capabilities | Sep 29, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Cooperative Task Offloading and Block Mining in Blockchain-based Edge Computing with Multi-agent Deep Reinforcement Learning | Sep 29, 2021 | channel selectionDeep Reinforcement Learning | —Unverified | 0 |
| Information-Bottleneck-Based Behavior Representation Learning for Multi-agent Reinforcement learning | Sep 29, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Generalizing Successor Features to continuous domains for Multi-task Learning | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Controllable Elements Oriented Representations for Reinforcement Learning | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Detecting Worst-case Corruptions via Loss Landscape Curvature in Deep Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Interpreting Reinforcement Policies through Local Behaviors | Sep 29, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Experience Replay More When It's a Key Transition in Deep Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| P4O: Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization | Sep 29, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Towards Unknown-aware Deep Q-Learning | Sep 29, 2021 | Deep Reinforcement LearningOut of Distribution (OOD) Detection | —Unverified | 0 |
| Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis | Sep 29, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| On the benefits of deep RL in accelerated MRI sampling | Sep 29, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| PDQN - A Deep Reinforcement Learning Method for Planning with Long Delays: Optimization of Manufacturing Dispatching | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Variance Reduced Domain Randomization for Policy Gradient | Sep 29, 2021 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game | Sep 29, 2021 | counterfactualDeep Reinforcement Learning | —Unverified | 0 |