| Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach | Aug 1, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Sampling, Communication, and Prediction Co-Design for Synchronizing the Real-World Device and Digital Model in Metaverse | Jul 31, 2022 | Deep Reinforcement LearningMixed Reality | —Unverified | 0 |
| DRL-M4MR: An Intelligent Multicast Routing Approach Based on DQN Deep Reinforcement Learning in SDN | Jul 31, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Unified Automatic Control of Vehicular Systems with Reinforcement Learning | Jul 30, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Solving the vehicle routing problem with deep reinforcement learning | Jul 30, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for System-on-Chip: Myths and Realities | Jul 29, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization | Jul 29, 2022 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning | Jul 29, 2022 | Contrastive LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Multi-Objective Provisioning of Network Slices using Deep Reinforcement Learning | Jul 27, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation | Jul 27, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms | Jul 27, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Learning Bipedal Walking On Planned Footsteps For Humanoid Robots | Jul 26, 2022 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 3 |
| Vision-Aided Blockage Avoidance in UAV-assisted V2X Communications | Jul 26, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| REPNP: Plug-and-Play with Deep Reinforcement Learning Prior for Robust Image Restoration | Jul 25, 2022 | DeblurringDeep Reinforcement Learning | —Unverified | 0 |
| Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts | Jul 24, 2022 | Deep Reinforcement LearningHumanoid Control | CodeCode Available | 1 |
| Learning an Adaptive Forwarding Strategy for Mobile Wireless Networks: Resource Usage vs. Latency | Jul 23, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Epersist: A Self Balancing Robot Using PID Controller And Deep Reinforcement Learning | Jul 23, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Halftoning with Multi-Agent Deep Reinforcement Learning | Jul 23, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Multi-Asset Closed-Loop Reservoir Management Using Deep Reinforcement Learning | Jul 21, 2022 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Reinforcement learning for Energies of the future and carbon neutrality: a Challenge Design | Jul 21, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Knowledge-enhanced Black-box Attacks for Recommendations | Jul 21, 2022 | AttributeDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Market Making Under a Hawkes Process-Based Limit Order Book Model | Jul 20, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Magpie: Automatically Tuning Static Parameters for Distributed File Systems using Deep Reinforcement Learning | Jul 19, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Abstract Demonstrations and Adaptive Exploration for Efficient and Stable Multi-step Sparse Reward Reinforcement Learning | Jul 19, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games | Jul 18, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |