| Hard instance learning for quantum adiabatic prime factorization | Oct 10, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Multi-condition multi-objective optimization using deep reinforcement learning | Oct 10, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation | Oct 9, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| ABCP: Automatic Block-wise and Channel-wise Network Pruning via Joint Search | Oct 8, 2021 | Deep Reinforcement LearningNetwork Pruning | CodeCode Available | 0 |
| CheerBots: Chatbots toward Empathy and Emotionusing Reinforcement Learning | Oct 8, 2021 | ChatbotDeep Reinforcement Learning | —Unverified | 0 |
| Robotic Lever Manipulation using Hindsight Experience Replay and Shapley Additive Explanations | Oct 7, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning | Oct 7, 2021 | Continuous ControlDeep Reinforcement Learning | —Unverified | 0 |
| Generalization in Deep RL for TSP Problems via Equivariance and Local Search | Oct 7, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Explaining Deep Reinforcement Learning Agents In The Atari Domain through a Surrogate Model | Oct 7, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| How to Sense the World: Leveraging Hierarchy in Multimodal Perception for Robust Reinforcement Learning Agents | Oct 7, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep Reinforcement Learning | Oct 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning Multi-Objective Curricula for Robotic Policy Learning | Oct 6, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Adaptive control of a mechatronic system using constrained residual reinforcement learning | Oct 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Can an AI agent hit a moving target? | Oct 6, 2021 | AI AgentDecision Making | —Unverified | 0 |
| Pretraining & Reinforcement Learning: Sharpening the Axe Before Cutting the Tree | Oct 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| On The Transferability of Deep-Q Networks | Oct 6, 2021 | Deep Reinforcement LearningTransfer Learning | —Unverified | 0 |
| Improving Generalization of Deep Reinforcement Learning-based TSP Solvers | Oct 6, 2021 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |
| Optimized Recommender Systems with Deep Reinforcement Learning | Oct 6, 2021 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 0 |
| Deep reinforcement learning for guidewire navigation in coronary artery phantom | Oct 5, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Mining for Potent Inhibitors through Artificial Intelligence and Physics: A Unified Methodology for Ligand Based and Structure Based Drug Design | Oct 5, 2021 | Deep Reinforcement LearningDrug Design | —Unverified | 0 |
| A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing | Oct 5, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| NaRLE: Natural Language Models using Reinforcement Learning with Emotion Feedback | Oct 5, 2021 | Deep Reinforcement Learningintent-classification | —Unverified | 0 |
| NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL | Oct 5, 2021 | Deep Reinforcement Learning | CodeCode Available | 0 |
| DeepEdge: A Deep Reinforcement Learning based Task Orchestrator for Edge Computing | Oct 5, 2021 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Hierarchical Primitive Composition: Simultaneous Activation of Skills with Inconsistent Action Dimensions in Multiple Hierarchies | Oct 5, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |