| A gray-box approach for curriculum learning | Jun 17, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Universal Successor Features Based Deep Reinforcement Learning for Navigation | Jun 17, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Self-Tuning Sectorization: Deep Reinforcement Learning Meets Broadcast Beam Optimization | Jun 14, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Cross-View Policy Learning for Street Navigation | Jun 13, 2019 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards | Jun 13, 2019 | Deep Reinforcement LearningFriction | CodeCode Available | 0 |
| Deep Reinforcement Learning for Cyber Security | Jun 13, 2019 | Deep Reinforcement LearningIntrusion Detection | —Unverified | 0 |
| Deep Reinforcement Learning for Unmanned Aerial Vehicle-Assisted Vehicular Networks | Jun 12, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep learning control of artificial avatars in group coordination tasks | Jun 11, 2019 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning | Jun 11, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning with Discrete Normalized Advantage Functions for Resource Management in Network Slicing | Jun 10, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Transfer Learning by Modeling a Distribution over Policies | Jun 9, 2019 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Neural Heterogeneous Scheduler | Jun 9, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning | Jun 9, 2019 | Deep Reinforcement LearningGPU | CodeCode Available | 0 |
| Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach | Jun 7, 2019 | Active LearningDeep Reinforcement Learning | —Unverified | 0 |
| Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies | Jun 6, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| An Extensible Interactive Interface for Agent Design | Jun 6, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Multi-objective Optimization | Jun 6, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Q-Learning for Directed Acyclic Graph Generation | Jun 5, 2019 | Deep Reinforcement LearningGraph Generation | —Unverified | 0 |
| Reinforcement Learning with Low-Complexity Liquid State Machines | Jun 4, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning dynamic polynomial proofs | Jun 4, 2019 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection | Jun 4, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Options as responses: Grounding behavioural hierarchies in multi-agent RL | Jun 4, 2019 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Off-Policy Evaluation via Off-Policy Classification | Jun 4, 2019 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning Architecture for Continuous Power Allocation in High Throughput Satellites | Jun 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies | Jun 3, 2019 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Decentralized Deep Reinforcement Learning for Delay-Power Tradeoff in Vehicular Communications | Jun 3, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Adversarial Exploitation of Policy Imitation | Jun 3, 2019 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies | Jun 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Load Balancing for Ultra-Dense Networks: A Deep Reinforcement Learning Based Approach | Jun 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual Navigation | Jun 2, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 0 |
| Neural Replicator Dynamics | Jun 1, 2019 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| Enhanced Bayesian Compression via Deep Reinforcement Learning | Jun 1, 2019 | Deep Reinforcement LearningQuantization | —Unverified | 0 |
| Decision-Making in Reinforcement Learning | Jun 1, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Interval timing in deep reinforcement learning agents | May 31, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning | May 31, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Effective Medical Test Suggestions Using Deep Reinforcement Learning | May 30, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| CopyCAT: Taking Control of Neural Policies with Constant Attacks | May 29, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement Learning | May 27, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning | May 27, 2019 | Deep Reinforcement LearningDialogue Management | —Unverified | 0 |
| Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning | May 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Reason in Large Theories without Imitation | May 25, 2019 | Automated Theorem ProvingDeep Reinforcement Learning | —Unverified | 0 |
| Transferable Cost-Aware Security Policy Implementation for Malware Detection Using Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement LearningMalware Detection | —Unverified | 0 |
| Composing Task-Agnostic Policies with Deep Reinforcement Learning | May 25, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Prioritized Sequence Experience Replay | May 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima | May 24, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Estimating Risk and Uncertainty in Deep Reinforcement Learning | May 23, 2019 | Bayesian InferenceDeep Reinforcement Learning | CodeCode Available | 0 |
| Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document Traversal | May 23, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Detecting Malicious Websites | May 22, 2019 | Deep Reinforcement LearningPhishing Website Detection | —Unverified | 0 |
| COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration | May 22, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Stochastic Variance Reduction for Deep Q-learning | May 20, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |