| Uniformly Conservative Exploration in Reinforcement Learning | Oct 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Recurrent Off-policy Baselines for Memory-based Continuous Control | Oct 25, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 |
| MARVEL: Raster Manga Vectorization via Primitive-wise Deep Reinforcement Learning | Oct 10, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations | Oct 9, 2021 | Deep Reinforcement LearningStarcraft | CodeCode Available | 1 |
| Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks | Oct 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Replay-Guided Adversarial Environment Design | Oct 6, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem | Oct 6, 2021 | DecoderDeep Reinforcement Learning | CodeCode Available | 1 |
| Continuous-Time Fitted Value Iteration for Robust Policies | Oct 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Large Batch Experience Replay | Oct 4, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley Values | Oct 4, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Unified Data Collection for Visual-Inertial Calibration via Deep Reinforcement Learning | Sep 30, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 |
| Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes | Sep 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Enhancing Navigational Safety in Crowded Environments using Semantic-Deep-Reinforcement-Learning-based Navigation | Sep 23, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| ENERO: Efficient Real-Time WAN Routing Optimization with Deep Reinforcement Learning | Sep 22, 2021 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 1 |
| Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tree Search | Sep 18, 2021 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Focus on Impact: Indoor Exploration with Intrinsic Motivation | Sep 14, 2021 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Learning to Navigate Intersections with Unsupervised Driver Trait Inference | Sep 14, 2021 | Autonomous NavigationAutonomous Vehicles | CodeCode Available | 1 |
| Learning Selective Communication for Multi-Agent Path Finding | Sep 12, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| DROP: Deep relocating option policy for optimal ride-hailing vehicle repositioning | Sep 9, 2021 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Optimizing Quantum Variational Circuits with Deep Reinforcement Learning | Sep 7, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Hierarchical Object-to-Zone Graph for Object Navigation | Sep 5, 2021 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Learning to Synthesize Programs as Interpretable and Generalizable Policies | Aug 31, 2021 | Deep Reinforcement LearningProgram Synthesis | CodeCode Available | 1 |
| WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU | Aug 31, 2021 | CPUDecision Making | CodeCode Available | 1 |