| Recurrent Off-policy Baselines for Memory-based Continuous Control | Oct 25, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Uniformly Conservative Exploration in Reinforcement Learning | Oct 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 |
| MARVEL: Raster Manga Vectorization via Primitive-wise Deep Reinforcement Learning | Oct 10, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations | Oct 9, 2021 | Deep Reinforcement LearningStarcraft | CodeCode Available | 1 |
| Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks | Oct 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Replay-Guided Adversarial Environment Design | Oct 6, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem | Oct 6, 2021 | DecoderDeep Reinforcement Learning | CodeCode Available | 1 |
| Continuous-Time Fitted Value Iteration for Robust Policies | Oct 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Large Batch Experience Replay | Oct 4, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley Values | Oct 4, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Unified Data Collection for Visual-Inertial Calibration via Deep Reinforcement Learning | Sep 30, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 |
| Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes | Sep 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Enhancing Navigational Safety in Crowded Environments using Semantic-Deep-Reinforcement-Learning-based Navigation | Sep 23, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| ENERO: Efficient Real-Time WAN Routing Optimization with Deep Reinforcement Learning | Sep 22, 2021 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 1 |
| Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tree Search | Sep 18, 2021 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Learning to Navigate Intersections with Unsupervised Driver Trait Inference | Sep 14, 2021 | Autonomous NavigationAutonomous Vehicles | CodeCode Available | 1 |
| Focus on Impact: Indoor Exploration with Intrinsic Motivation | Sep 14, 2021 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Learning Selective Communication for Multi-Agent Path Finding | Sep 12, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| DROP: Deep relocating option policy for optimal ride-hailing vehicle repositioning | Sep 9, 2021 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Optimizing Quantum Variational Circuits with Deep Reinforcement Learning | Sep 7, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Hierarchical Object-to-Zone Graph for Object Navigation | Sep 5, 2021 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU | Aug 31, 2021 | CPUDecision Making | CodeCode Available | 1 |
| Learning to Synthesize Programs as Interpretable and Generalizable Policies | Aug 31, 2021 | Deep Reinforcement LearningProgram Synthesis | CodeCode Available | 1 |
| Deep Reinforcement Learning at the Edge of the Statistical Precipice | Aug 30, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Reinforcement Learning based Condition-oriented Maintenance Scheduling for Flow Line Systems | Aug 27, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Responsive Regulation of Dynamic UAV Communication Networks Based on Deep Reinforcement Learning | Aug 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Diversity-based Trajectory and Goal Selection with Hindsight Experience Replay | Aug 17, 2021 | Deep Reinforcement LearningDiversity | CodeCode Available | 1 |
| Safe Deep Reinforcement Learning for Multi-Agent Systems with Continuous Action Spaces | Aug 9, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning | Aug 5, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm | Jul 27, 2021 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| MarsExplorer: Exploration of Unknown Terrains via Deep Reinforcement Learning and Procedurally Generated Environments | Jul 21, 2021 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 1 |
| Co-designing Intelligent Control of Building HVACs and Microgrids | Jul 18, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification | Jul 15, 2021 | Conformal PredictionDeep Reinforcement Learning | CodeCode Available | 1 |
| ReLLIE: Deep Reinforcement Learning for Customized Low-Light Image Enhancement | Jul 13, 2021 | Deep Reinforcement LearningImage Enhancement | CodeCode Available | 1 |
| Distributed Online Service Coordination Using Deep Reinforcement Learning | Jul 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement Learning | Jul 6, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation | Jul 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Predicting Human Scanpaths in Visual Question Answering | Jun 19, 2021 | Deep Reinforcement LearningQuestion Answering | CodeCode Available | 1 |
| Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk | Jun 18, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Tactile Sim-to-Real Policy Transfer via Real-to-Sim Image Translation | Jun 16, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Conservation Decisions | Jun 15, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning based Group Recommender System | Jun 13, 2021 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Pretraining Representations for Data-Efficient Reinforcement Learning | Jun 9, 2021 | Atari GamesAtari Games 100k | CodeCode Available | 1 |
| Pretrained Encoders are All You Need | Jun 9, 2021 | AllContrastive Learning | CodeCode Available | 1 |
| Learning Markov State Abstractions for Deep Reinforcement Learning | Jun 8, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning | Jun 8, 2021 | Continuous Control (100k environment steps)Continuous Control (500k environment steps) | CodeCode Available | 1 |
| Dynamic Sparse Training for Deep Reinforcement Learning | Jun 8, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |