| Multi-IRS-assisted Multi-Cell Uplink MIMO Communications under Imperfect CSI: A Deep Reinforcement Learning Approach | Nov 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Causal Campbell-Goodhart's law and Reinforcement Learning | Nov 2, 2020 | Causal InferenceDecision Making | CodeCode Available | 0 |
| Learning a Deep Reinforcement Learning Policy Over the Latent Space of a Pre-trained GAN for Semantic Age Manipulation | Nov 2, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Actor-Double-Critic: Incorporating Model-Based Critic for Task-Oriented Dialogue Systems | Nov 1, 2020 | Deep Reinforcement LearningSpoken Dialogue Systems | —Unverified | 0 |
| Can a Robot Trust You? A DRL-Based Approach to Trust-Driven Human-Guided Navigation | Nov 1, 2020 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Learning When to Switch: Composing Controllers to Traverse a Sequence of Terrain Artifacts | Nov 1, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Efficient Learning of Control Policies for Robust Quadruped Bounding using Pretrained Neural Networks | Nov 1, 2020 | Deep Reinforcement LearningFeature Engineering | —Unverified | 0 |
| Deep Reactive Planning in Dynamic Environments | Oct 31, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Topic-Preserving Synthetic News Generation: An Adversarial Deep Reinforcement Learning Approach | Oct 30, 2020 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Machine versus Human Attention in Deep Reinforcement Learning Tasks | Oct 29, 2020 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| DeepFoldit -- A Deep Reinforcement Learning Neural Network Folding Proteins | Oct 28, 2020 | Deep Reinforcement LearningProtein Structure Prediction | —Unverified | 0 |
| Designing Interpretable Approximations to Deep Reinforcement Learning | Oct 28, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines? | Oct 27, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning | Oct 27, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning Financial Asset-Specific Trading Rules via Deep Reinforcement Learning | Oct 27, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Pairwise heuristic sequence alignment algorithm based on deep reinforcement learning | Oct 26, 2020 | Deep Reinforcement LearningMultiple Sequence Alignment | —Unverified | 0 |
| Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-Network | Oct 26, 2020 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Lyapunov-Based Reinforcement Learning State Estimator | Oct 26, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Personalised Meta-path Generation for Heterogeneous GNNs | Oct 26, 2020 | Deep Reinforcement LearningGraph Representation Learning | CodeCode Available | 1 |
| XLVIN: eXecuted Latent Value Iteration Nets | Oct 25, 2020 | Deep Reinforcement LearningGraph Representation Learning | —Unverified | 0 |
| How to Make Deep RL Work in Practice | Oct 25, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Adaptive Federated Learning and Digital Twin for Industrial Internet of Things | Oct 25, 2020 | ClusteringDeep Reinforcement Learning | —Unverified | 0 |
| Improving the Exploration of Deep Reinforcement Learning in Continuous Domains using Planning for Policy Search | Oct 24, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Learning Guidance Rewards with Trajectory-space Smoothing | Oct 23, 2020 | AttributeDeep Reinforcement Learning | CodeCode Available | 1 |
| Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning | Oct 23, 2020 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 1 |