| A Risk-Sensitive Policy Gradient Method | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning Efficient Online 3D Bin Packing on Packing Configuration Trees | Sep 29, 2021 | 3D Bin PackingDeep Reinforcement Learning | CodeCode Available | 2 |
| Generalizing Successor Features to continuous domains for Multi-task Learning | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data | Sep 29, 2021 | Deep Reinforcement LearningOff-policy evaluation | —Unverified | 0 |
| The Remarkable Effectiveness of Combining Policy and Value Networks in A*-based Deep RL for AI Planning | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Towards Unknown-aware Deep Q-Learning | Sep 29, 2021 | Deep Reinforcement LearningOut of Distribution (OOD) Detection | —Unverified | 0 |
| Reinforcement Learning with Predictive Consistent Representations | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Information-Bottleneck-Based Behavior Representation Learning for Multi-agent Reinforcement learning | Sep 29, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Mitigation of Adversarial Policy Imitation via Constrained Randomization of Policy (CRoP) | Sep 29, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Formulation and validation of a car-following model based on deep reinforcement learning | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Cooperative Task Offloading and Block Mining in Blockchain-based Edge Computing with Multi-agent Deep Reinforcement Learning | Sep 29, 2021 | channel selectionDeep Reinforcement Learning | —Unverified | 0 |
| Improving Safety in Deep Reinforcement Learning using Unsupervised Action Planning | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey | Sep 28, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Identifying Reasoning Flaws in Planning-Based RL Using Tree Explanations | Sep 28, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| An Offline Deep Reinforcement Learning for Maintenance Decision-Making | Sep 28, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Longitudinal Deep Truck: Deep learning and deep reinforcement learning for modeling and control of longitudinal dynamics of heavy duty trucks | Sep 28, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Adaptive Informative Path Planning Using Deep Reinforcement Learning for UAV-based Active Sensing | Sep 28, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Exploring More When It Needs in Deep Reinforcement Learning | Sep 28, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Reinforcement Learning with Adjustments | Sep 28, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research | Sep 27, 2021 | Deep Reinforcement LearningNetHack | —Unverified | 0 |
| Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration | Sep 27, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| DRL-based Slice Placement under Realistic Network Load Conditions | Sep 27, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| PM-FSM: Policies Modulating Finite State Machine for Robust Quadrupedal Locomotion | Sep 26, 2021 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Deep Reinforcement Learning for Wireless Scheduling in Distributed Networked Control | Sep 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes | Sep 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |