| Off-Policy Evaluation via Off-Policy Classification | Jun 4, 2019 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Adversarial Exploitation of Policy Imitation | Jun 3, 2019 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies | Jun 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies | Jun 3, 2019 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning Architecture for Continuous Power Allocation in High Throughput Satellites | Jun 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Decentralized Deep Reinforcement Learning for Delay-Power Tradeoff in Vehicular Communications | Jun 3, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Load Balancing for Ultra-Dense Networks: A Deep Reinforcement Learning Based Approach | Jun 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual Navigation | Jun 2, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 0 |
| Neural Replicator Dynamics | Jun 1, 2019 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| Decision-Making in Reinforcement Learning | Jun 1, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Enhanced Bayesian Compression via Deep Reinforcement Learning | Jun 1, 2019 | Deep Reinforcement LearningQuantization | —Unverified | 0 |
| Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning | May 31, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Interval timing in deep reinforcement learning agents | May 31, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Effective Medical Test Suggestions Using Deep Reinforcement Learning | May 30, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| CopyCAT: Taking Control of Neural Policies with Constant Attacks | May 29, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Snooping Attacks on Deep Reinforcement Learning | May 28, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning | May 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement Learning | May 27, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning | May 27, 2019 | Deep Reinforcement LearningDialogue Management | —Unverified | 0 |
| Prioritized Sequence Experience Replay | May 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Composing Task-Agnostic Policies with Deep Reinforcement Learning | May 25, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Transferable Cost-Aware Security Policy Implementation for Malware Detection Using Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement LearningMalware Detection | —Unverified | 0 |
| Learning to Reason in Large Theories without Imitation | May 25, 2019 | Automated Theorem ProvingDeep Reinforcement Learning | —Unverified | 0 |
| Adversarial Policies: Attacking Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima | May 24, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Estimating Risk and Uncertainty in Deep Reinforcement Learning | May 23, 2019 | Bayesian InferenceDeep Reinforcement Learning | CodeCode Available | 0 |
| Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document Traversal | May 23, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration | May 22, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Deep Reinforcement Learning for Detecting Malicious Websites | May 22, 2019 | Deep Reinforcement LearningPhishing Website Detection | —Unverified | 0 |
| Stochastic Variance Reduction for Deep Q-learning | May 20, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning Based Parameter Control in Differential Evolution | May 20, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| In Support of Over-Parametrization in Deep Reinforcement Learning: an Empirical Study | May 17, 2019 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning | May 17, 2019 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem | May 17, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning-Based Channel Allocation for Wireless LANs with Graph Convolutional Networks | May 17, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to learn to communicate | May 16, 2019 | Deep Reinforcement LearningMeta-Learning | —Unverified | 0 |
| Knowledge-Based Sequential Decision-Making Under Uncertainty | May 16, 2019 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Meta Reinforcement Learning with Task Embedding and Shared Policy | May 16, 2019 | Deep Reinforcement LearningMeta-Learning | CodeCode Available | 0 |
| Learning Active Spine Behaviors for Dynamic and Efficient Locomotion in Quadruped Robots | May 15, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Deep reinforcement learning for scheduling in large-scale networked control systems | May 15, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Scheduling in Cellular Networks | May 15, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| TauRieL: Targeting Traveling Salesman Problem with a deep reinforcement learning inspired architecture | May 14, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Trajectory-Based Off-Policy Deep Reinforcement Learning | May 14, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Task-Agnostic Dynamics Priors for Deep Reinforcement Learning | May 13, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Diagnosing Reinforcement Learning for Traffic Signal Control | May 12, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Graph Attention Memory for Visual Navigation | May 11, 2019 | Deep Reinforcement LearningGraph Attention | —Unverified | 0 |
| Optimizing Routerless Network-on-Chip Designs: An Innovative Learning-Based Framework | May 11, 2019 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Intelligent User Association for Symbiotic Radio Networks using Deep Reinforcement Learning | May 10, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Do Autonomous Agents Benefit from Hearing? | May 10, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| GAN-powered Deep Distributional Reinforcement Learning for Resource Management in Network Slicing | May 10, 2019 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 |