| Multi-IRS-assisted Multi-Cell Uplink MIMO Communications under Imperfect CSI: A Deep Reinforcement Learning Approach | Nov 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Causal Campbell-Goodhart's law and Reinforcement Learning | Nov 2, 2020 | Causal InferenceDecision Making | CodeCode Available | 0 |
| Learning a Deep Reinforcement Learning Policy Over the Latent Space of a Pre-trained GAN for Semantic Age Manipulation | Nov 2, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Actor-Double-Critic: Incorporating Model-Based Critic for Task-Oriented Dialogue Systems | Nov 1, 2020 | Deep Reinforcement LearningSpoken Dialogue Systems | —Unverified | 0 |
| Can a Robot Trust You? A DRL-Based Approach to Trust-Driven Human-Guided Navigation | Nov 1, 2020 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Learning When to Switch: Composing Controllers to Traverse a Sequence of Terrain Artifacts | Nov 1, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Efficient Learning of Control Policies for Robust Quadruped Bounding using Pretrained Neural Networks | Nov 1, 2020 | Deep Reinforcement LearningFeature Engineering | —Unverified | 0 |
| Deep Reactive Planning in Dynamic Environments | Oct 31, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Topic-Preserving Synthetic News Generation: An Adversarial Deep Reinforcement Learning Approach | Oct 30, 2020 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Machine versus Human Attention in Deep Reinforcement Learning Tasks | Oct 29, 2020 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| DeepFoldit -- A Deep Reinforcement Learning Neural Network Folding Proteins | Oct 28, 2020 | Deep Reinforcement LearningProtein Structure Prediction | —Unverified | 0 |
| Designing Interpretable Approximations to Deep Reinforcement Learning | Oct 28, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines? | Oct 27, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning | Oct 27, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning Financial Asset-Specific Trading Rules via Deep Reinforcement Learning | Oct 27, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Pairwise heuristic sequence alignment algorithm based on deep reinforcement learning | Oct 26, 2020 | Deep Reinforcement LearningMultiple Sequence Alignment | —Unverified | 0 |
| Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-Network | Oct 26, 2020 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Lyapunov-Based Reinforcement Learning State Estimator | Oct 26, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Personalised Meta-path Generation for Heterogeneous GNNs | Oct 26, 2020 | Deep Reinforcement LearningGraph Representation Learning | CodeCode Available | 1 |
| XLVIN: eXecuted Latent Value Iteration Nets | Oct 25, 2020 | Deep Reinforcement LearningGraph Representation Learning | —Unverified | 0 |
| How to Make Deep RL Work in Practice | Oct 25, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Adaptive Federated Learning and Digital Twin for Industrial Internet of Things | Oct 25, 2020 | ClusteringDeep Reinforcement Learning | —Unverified | 0 |
| Improving the Exploration of Deep Reinforcement Learning in Continuous Domains using Planning for Policy Search | Oct 24, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Learning Guidance Rewards with Trajectory-space Smoothing | Oct 23, 2020 | AttributeDeep Reinforcement Learning | CodeCode Available | 1 |
| Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning | Oct 23, 2020 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning | Oct 23, 2020 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 1 |
| Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning | Oct 23, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing | Oct 22, 2020 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games | Oct 22, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Adversarial Attacks on Deep Algorithmic Trading Policies | Oct 22, 2020 | Algorithmic TradingDeep Reinforcement Learning | —Unverified | 0 |
| Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments | Oct 22, 2020 | Contact-rich ManipulationDeep Reinforcement Learning | —Unverified | 0 |
| Transferable Graph Optimizers for ML Compilers | Oct 21, 2020 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |
| Deep Surrogate Q-Learning for Autonomous Driving | Oct 21, 2020 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement Learning | Oct 21, 2020 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Correlation-aware Cooperative Multigroup Broadcast 360° Video Delivery Network: A Hierarchical Deep Reinforcement Learning Approach | Oct 21, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Improving Generalization in Reinforcement Learning with Mixture Regularization | Oct 21, 2020 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| Learn to Navigate Maplessly with Varied LiDAR Configurations: A Support Point-Based Approach | Oct 20, 2020 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Iterative Amortized Policy Optimization | Oct 20, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deep Reinforcement Learning in Lane Merge Coordination for Connected Vehicles | Oct 20, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Negotiating Team Formation Using Deep Reinforcement Learning | Oct 20, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Integrating LEO Satellites and Multi-UAV Reinforcement Learning for Hybrid FSO/RF Non-Terrestrial Networks | Oct 20, 2020 | Deep Reinforcement LearningDimensionality Reduction | —Unverified | 0 |
| Quality of service based radar resource management using deep reinforcement learning | Oct 20, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Adaptive Network Slicing in 5G for Intelligent Vehicular Systems and Smart Cities | Oct 19, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Evaluating the Safety of Deep Reinforcement Learning Models using Semi-Formal Verification | Oct 19, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous Control | Oct 19, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Chance-Constrained Control with Lexicographic Deep Reinforcement Learning | Oct 19, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Scalable Evolution Strategies Pipeline for Solving the Vehicle Routing Problem | Oct 17, 2020 | Deep Reinforcement LearningGPU | —Unverified | 0 |
| Neural Large Neighborhood Search | Oct 17, 2020 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Neural Algorithms for Graph Navigation | Oct 17, 2020 | Deep Reinforcement LearningGraph Learning | —Unverified | 0 |
| Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning | Oct 17, 2020 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |