| Decentralized Cooperative Lane Changing at Freeway Weaving Areas Using Multi-Agent Deep Reinforcement Learning | Oct 5, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Automating Privilege Escalation with Deep Reinforcement Learning | Oct 4, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Multi-Agent Path Planning Using Deep Reinforcement Learning | Oct 4, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Admission Control in Wireless Virtual Network Embedding | Oct 4, 2021 | Deep Reinforcement LearningNetwork Embedding | —Unverified | 0 |
| Behaviour-conditioned policies for cooperative reinforcement learning tasks | Oct 4, 2021 | Deep Reinforcement LearningMeta-Learning | —Unverified | 0 |
| DRL-Clusters: Buffer Management with Clustering based Deep Reinforcement Learning | Oct 3, 2021 | ClusteringDeep Reinforcement Learning | —Unverified | 0 |
| A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances | Oct 3, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Solving the Real Robot Challenge using Deep Reinforcement Learning | Sep 30, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| A Privacy-preserving Distributed Training Framework for Cooperative Multi-agent Deep Reinforcement Learning | Sep 30, 2021 | Deep Reinforcement LearningPrivacy Preserving | —Unverified | 0 |
| Stability Constrained Reinforcement Learning for Real-Time Voltage Control | Sep 30, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Trajectory Planning with Deep Reinforcement Learning in High-Level Action Spaces | Sep 30, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Bitcoin Transaction Strategy Construction Based on Deep Reinforcement Learning | Sep 30, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Modeling Interactions of Autonomous Vehicles and Pedestrians with Deep Multi-Agent Reinforcement Learning for Collision Avoidance | Sep 30, 2021 | Autonomous VehiclesCollision Avoidance | —Unverified | 0 |
| Neural Network Verification in Control | Sep 30, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Maximizing Ensemble Diversity in Deep Reinforcement Learning | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| Programmatic Reinforcement Learning without Oracles | Sep 29, 2021 | Bilevel OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| MARNET: Backdoor Attacks against Value-Decomposition Multi-Agent Reinforcement Learning | Sep 29, 2021 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| Task-driven Discovery of Perceptual Schemas for Generalization in Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Adversarial Style Transfer for Robust Policy Optimization in Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| CausalDyna: Improving Generalization of Dyna-style Reinforcement Learning via Counterfactual-Based Data Augmentation | Sep 29, 2021 | counterfactualData Augmentation | —Unverified | 0 |
| Reinforcement Learning with Predictive Consistent Representations | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Assessing Deep Reinforcement Learning Policies via Natural Corruptions at the Edge of Imperceptibility | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Learning of Intrinsically Motivated Options in the Arcade Learning Environment | Sep 29, 2021 | Atari GamesBenchmarking | —Unverified | 0 |
| Improving Safety in Deep Reinforcement Learning using Unsupervised Action Planning | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data | Sep 29, 2021 | Deep Reinforcement LearningOff-policy evaluation | —Unverified | 0 |