| Deep Reinforcement Learning-based Anti-jamming Power Allocation in a Two-cell NOMA Network | Jan 1, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Curriculum-based Deep Reinforcement Learning for Quantum Control | Dec 31, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Autonomous Maintenance in IoT Networks via AoI-driven Deep Reinforcement Learning | Dec 31, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Relational Deep Reinforcement Learning for Routing in Wireless Networks | Dec 31, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Robotic Grasping of Fully-Occluded Objects using RF Perception | Dec 31, 2020 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| A Deep Reinforcement Learning Based Multi-Criteria Decision Support System for Textile Manufacturing Process Optimization | Dec 29, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Leveraging AI and Intelligent Reflecting Surface for Energy-Efficient Communication in 6G IoT | Dec 29, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Federated Multi-Agent Actor-Critic Learning for Age Sensitive Mobile Edge Computing | Dec 28, 2020 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy | Dec 28, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Portfolio Optimization with 2D Relative-Attentional Gated Transformer | Dec 27, 2020 | Deep Reinforcement LearningPortfolio Optimization | —Unverified | 0 |
| Deep Reinforcement Learning for Long-Short Portfolio Optimization | Dec 26, 2020 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| Towards sample-efficient episodic control with DAC-ML | Dec 26, 2020 | Deep Reinforcement LearningHippocampus | —Unverified | 0 |
| Learning Vehicle Routing Problems using Policy Optimisation | Dec 24, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A State Representation Dueling Network for Deep Reinforcement Learning | Dec 24, 2020 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |
| Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search | Dec 24, 2020 | Deep Reinforcement LearningNeural Architecture Search | —Unverified | 0 |
| SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II | Dec 24, 2020 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Rethink AI-based Power Grid Control: Diving Into Algorithm Design | Dec 23, 2020 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Intelligent Resource Allocation in Dense LoRa Networks using Deep Reinforcement Learning | Dec 22, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Scalable Deep Reinforcement Learning for Routing and Spectrum Access in Physical Layer | Dec 22, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning a Group-Aware Policy for Robot Navigation | Dec 22, 2020 | Deep Reinforcement LearningRobot Navigation | —Unverified | 0 |
| Mobile Robot Planner with Low-cost Cameras Using Deep Reinforcement Learning | Dec 21, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Forming Real-World Human-Robot Cooperation for Tasks With General Goal | Dec 19, 2020 | Bayesian InferenceDeep Reinforcement Learning | —Unverified | 0 |
| Minimax Strikes Back | Dec 19, 2020 | Deep Reinforcement LearningGPU | —Unverified | 0 |
| Embodied Visual Active Learning for Semantic Segmentation | Dec 17, 2020 | Active LearningDeep Reinforcement Learning | —Unverified | 0 |
| Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL | Dec 17, 2020 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Towards Optimal District Heating Temperature Control in China with Deep Reinforcement Learning | Dec 17, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey | Dec 17, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning | Dec 17, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| LiveMap: Real-Time Dynamic Map in Automotive Edge Computing | Dec 16, 2020 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation | Dec 16, 2020 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 |
| Super Reinforcement Bros: Playing Super Mario Bros with Reinforcement Learning | Dec 14, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Learning Mobile Robot Navigation in the Dense Crowd with Deep Reinforcement Learning | Dec 14, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Towards Understanding Deep Policy Gradients: A Case Study on PPO | Dec 14, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Mobile Robots Exploration via Deep Reinforcement Learning | Dec 14, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Smoothing Deep Reinforcement Learning for Power Control for Spectrum Sharing in Cognitive Radios | Dec 14, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| IPM Move Planner: AN EFFICIENT EXPLOITING DEEP REINFORCEMENT LEARNING WITH MONTE CARLO TREE SEARCH | Dec 14, 2020 | BlockingDeep Reinforcement Learning | —Unverified | 0 |
| Learn to Play Tetris with Deep Reinforcement Learning | Dec 14, 2020 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| A Reinforcement Learning Formulation of the Lyapunov Optimization: Application to Edge Computing Systems with Queue Stability | Dec 14, 2020 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Regularizing Action Policies for Smooth Control with Reinforcement Learning | Dec 11, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Stock Portfolio Optimization | Dec 9, 2020 | Deep Reinforcement LearningPortfolio Optimization | —Unverified | 0 |
| Interactive Search Based on Deep Reinforcement Learning | Dec 9, 2020 | ClusteringDecision Making | —Unverified | 0 |
| Deep Reinforcement Learning for Long Term Hydropower Production Scheduling | Dec 9, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Ramp Metering Based on Traffic Video Data | Dec 9, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Resolving Implicit Coordination in Multi-Agent Deep Reinforcement Learning with Deep Q-Networks & Game Theory | Dec 8, 2020 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Emergence of Different Modes of Tool Use in a Reaching and Dragging Task | Dec 8, 2020 | Deep Reinforcement LearningFriction | —Unverified | 0 |
| The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems | Dec 8, 2020 | CPUDeep Reinforcement Learning | —Unverified | 0 |
| Efficient Reservoir Management through Deep Reinforcement Learning | Dec 7, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Battery Model Calibration with Deep Reinforcement Learning | Dec 7, 2020 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Deep Policy Networks for NPC Behaviors that Adapt to Changing Design Parameters in Roguelike Games | Dec 7, 2020 | Deep Reinforcement LearningGame Design | —Unverified | 0 |
| Fever Basketball: A Complex, Flexible, and Asynchronized Sports Game Environment for Multi-agent Reinforcement Learning | Dec 6, 2020 | Board GamesDeep Reinforcement Learning | —Unverified | 0 |