| CheerBots: Chatbots toward Empathy and Emotionusing Reinforcement Learning | Oct 8, 2021 | ChatbotDeep Reinforcement Learning | —Unverified | 0 |
| ABCP: Automatic Block-wise and Channel-wise Network Pruning via Joint Search | Oct 8, 2021 | Deep Reinforcement LearningNetwork Pruning | CodeCode Available | 0 |
| Explaining Deep Reinforcement Learning Agents In The Atari Domain through a Surrogate Model | Oct 7, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations | Oct 7, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks | Oct 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning | Oct 7, 2021 | Continuous ControlDeep Reinforcement Learning | —Unverified | 0 |
| Generalization in Deep RL for TSP Problems via Equivariance and Local Search | Oct 7, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| How to Sense the World: Leveraging Hierarchy in Multimodal Perception for Robust Reinforcement Learning Agents | Oct 7, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning Multi-Objective Curricula for Robotic Policy Learning | Oct 6, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Optimized Recommender Systems with Deep Reinforcement Learning | Oct 6, 2021 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 0 |
| Can an AI agent hit a moving target? | Oct 6, 2021 | AI AgentDecision Making | —Unverified | 0 |
| Replay-Guided Adversarial Environment Design | Oct 6, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Adaptive control of a mechatronic system using constrained residual reinforcement learning | Oct 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| On The Transferability of Deep-Q Networks | Oct 6, 2021 | Deep Reinforcement LearningTransfer Learning | —Unverified | 0 |
| Improving Generalization of Deep Reinforcement Learning-based TSP Solvers | Oct 6, 2021 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |
| Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep Reinforcement Learning | Oct 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem | Oct 6, 2021 | DecoderDeep Reinforcement Learning | CodeCode Available | 1 |
| Pretraining & Reinforcement Learning: Sharpening the Axe Before Cutting the Tree | Oct 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Decentralized Cooperative Lane Changing at Freeway Weaving Areas Using Multi-Agent Deep Reinforcement Learning | Oct 5, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL | Oct 5, 2021 | Deep Reinforcement Learning | CodeCode Available | 0 |
| DeepEdge: A Deep Reinforcement Learning based Task Orchestrator for Edge Computing | Oct 5, 2021 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Hierarchical Primitive Composition: Simultaneous Activation of Skills with Inconsistent Action Dimensions in Multiple Hierarchies | Oct 5, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Continuous-Time Fitted Value Iteration for Robust Policies | Oct 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| NaRLE: Natural Language Models using Reinforcement Learning with Emotion Feedback | Oct 5, 2021 | Deep Reinforcement Learningintent-classification | —Unverified | 0 |
| A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing | Oct 5, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 |