| Towards Understanding Chinese Checkers with Heuristics, Monte Carlo Tree Search, and Deep Reinforcement Learning | Mar 5, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Microscopic Traffic Simulation by Cooperative Multi-agent Deep Reinforcement Learning | Mar 4, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Budgeted Reinforcement Learning in Continuous State Space | Mar 3, 2019 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 0 |
| OmniDRL: Robust Pedestrian Detection using Deep Reinforcement Learning on Omnidirectional Cameras | Mar 2, 2019 | Deep Reinforcement LearningPedestrian Detection | —Unverified | 0 |
| TrojDRL: Trojan Attacks on Deep Reinforcement Learning Agents | Mar 1, 2019 | Data PoisoningDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning To Follow Directions in Street View | Mar 1, 2019 | Deep Reinforcement LearningInstruction Following | CodeCode Available | 0 |
| Catalyst.RL: A Distributed Framework for Reproducible RL Research | Feb 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Neural Packet Classification | Feb 27, 2019 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Adaptive Caching in Hierarchical Content Delivery Networks | Feb 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning Multi-agent Communication under Limited-bandwidth Restriction for Internet Packet Routing | Feb 26, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Diagnosing Bottlenecks in Deep Q-learning Algorithms | Feb 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Coloring Big Graphs with AlphaGoZero | Feb 26, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Can Meta-Interpretive Learning outperform Deep Reinforcement Learning of Evaluable Game strategies? | Feb 26, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Joint Modeling of Dense and Incomplete Trajectories for Citywide Traffic Volume Inference | Feb 25, 2019 | Deep Reinforcement LearningGraph Embedding | —Unverified | 0 |
| Marathon Environments: Multi-Agent Continuous Control Benchmarks in a Modern Video Game Engine | Feb 25, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Flappy Hummingbird: An Open Source Dynamic Simulation of Flapping Wing Robots and Animals | Feb 25, 2019 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Learning Deterministic Policy with Target for Power Control in Wireless Networks | Feb 21, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning using Genetic Algorithm for Parameter Optimization | Feb 19, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| DOM-Q-NET: Grounded RL on Structured Language | Feb 19, 2019 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| Investigating Generalisation in Continuous Deep Reinforcement Learning | Feb 19, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Autonomous Airline Revenue Management: A Deep Reinforcement Learning Approach to Seat Inventory Control and Overbooking | Feb 18, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Message-Dropout: An Efficient Training Method for Multi-Agent Deep Reinforcement Learning | Feb 18, 2019 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Leveraging Communication Topologies Between Learning Agents in Deep Reinforcement Learning | Feb 16, 2019 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| AutoQ: Automated Kernel-Wise Neural Network Quantization | Feb 15, 2019 | AutoMLDeep Reinforcement Learning | —Unverified | 0 |
| Network Offloading Policies for Cloud Robotics: a Learning-based Approach | Feb 15, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning | Feb 15, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning Based High-level Driving Behavior Decision-making Model in Heterogeneous Traffic | Feb 15, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General Entropy and Effective Environment Exploration in Deep Reinforcement Learning | Feb 14, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Active Perception in Adversarial Scenarios using Maximum Entropy Deep Reinforcement Learning | Feb 14, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity | Feb 14, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deep Reinforcement Learning from Policy-Dependent Human Feedback | Feb 12, 2019 | Deep Reinforcement LearningMinecraft | —Unverified | 0 |
| Latent Space Reinforcement Learning for Steering Angle Prediction | Feb 11, 2019 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| Generalization through Simulation: Integrating Simulated and Real Data into Deep Reinforcement Learning for Vision-Based Autonomous Flight | Feb 11, 2019 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 0 |
| WiseMove: A Framework for Safe Deep Reinforcement Learning for Autonomous Driving | Feb 11, 2019 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| A Bandit Framework for Optimal Selection of Reinforcement Learning Agents | Feb 10, 2019 | Deep Reinforcement LearningInductive Bias | —Unverified | 0 |
| Novelty Search for Deep Reinforcement Learning Policy Network Weights by Action Sequence Edit Metric Distance | Feb 8, 2019 | Deep Reinforcement LearningEvolutionary Algorithms | CodeCode Available | 0 |
| Visual search and recognition for robot task execution and monitoring | Feb 7, 2019 | Common Sense ReasoningDeep Reinforcement Learning | —Unverified | 0 |
| Metaoptimization on a Distributed System for Deep Reinforcement Learning | Feb 7, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Artificial Intelligence for Prosthetics - challenge solutions | Feb 7, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Distilling Policy Distillation | Feb 6, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Adaptive Stress Testing for Autonomous Vehicles | Feb 5, 2019 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Learning to Schedule Communication in Multi-agent Reinforcement Learning | Feb 5, 2019 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Learning to Learn in Simulation | Feb 5, 2019 | Deep Reinforcement Learningobject-detection | —Unverified | 0 |
| Embodied Multimodal Multitask Learning | Feb 4, 2019 | Deep Reinforcement LearningDisentanglement | —Unverified | 0 |
| Joint Entity Linking with Deep Reinforcement Learning | Feb 1, 2019 | Deep Reinforcement LearningEntity Disambiguation | —Unverified | 0 |
| Policy Consolidation for Continual Reinforcement Learning | Feb 1, 2019 | Continual Learningcontinuous-control | CodeCode Available | 0 |
| Visual Rationalizations in Deep Reinforcement Learning for Atari Games | Feb 1, 2019 | Atari GamesDecision Making | —Unverified | 0 |
| A Theory of Regularized Markov Decision Processes | Jan 31, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement | Jan 30, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Safe, Efficient, and Comfortable Velocity Control based on Reinforcement Learning for Autonomous Driving | Jan 29, 2019 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 0 |