| Off-Policy Evaluation via Off-Policy Classification | Jun 4, 2019 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Adversarial Exploitation of Policy Imitation | Jun 3, 2019 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies | Jun 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies | Jun 3, 2019 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning Architecture for Continuous Power Allocation in High Throughput Satellites | Jun 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Decentralized Deep Reinforcement Learning for Delay-Power Tradeoff in Vehicular Communications | Jun 3, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Load Balancing for Ultra-Dense Networks: A Deep Reinforcement Learning Based Approach | Jun 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual Navigation | Jun 2, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 0 |
| Neural Replicator Dynamics | Jun 1, 2019 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| Decision-Making in Reinforcement Learning | Jun 1, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Enhanced Bayesian Compression via Deep Reinforcement Learning | Jun 1, 2019 | Deep Reinforcement LearningQuantization | —Unverified | 0 |
| Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning | May 31, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Interval timing in deep reinforcement learning agents | May 31, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Effective Medical Test Suggestions Using Deep Reinforcement Learning | May 30, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| CopyCAT: Taking Control of Neural Policies with Constant Attacks | May 29, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Snooping Attacks on Deep Reinforcement Learning | May 28, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning | May 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement Learning | May 27, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning | May 27, 2019 | Deep Reinforcement LearningDialogue Management | —Unverified | 0 |
| Prioritized Sequence Experience Replay | May 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Composing Task-Agnostic Policies with Deep Reinforcement Learning | May 25, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Transferable Cost-Aware Security Policy Implementation for Malware Detection Using Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement LearningMalware Detection | —Unverified | 0 |
| Learning to Reason in Large Theories without Imitation | May 25, 2019 | Automated Theorem ProvingDeep Reinforcement Learning | —Unverified | 0 |
| Adversarial Policies: Attacking Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima | May 24, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |