| Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation | May 22, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Co-design of Embodied Neural Intelligence via Constrained Evolution | May 21, 2022 | Deep Reinforcement LearningGPU | —Unverified | 0 |
| Long Run Incremental Cost (LRIC) Distribution Network Pricing in UK, advising China's Distribution Network | May 20, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Adversarial joint attacks on legged robots | May 20, 2022 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Adversarial Body Shape Search for Legged Robots | May 20, 2022 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 |
| Task Relabelling for Multi-task Transfer using Successor Features | May 20, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| On Jointly Optimizing Partial Offloading and SFC Mapping: A Cooperative Dual-agent Deep Reinforcement Learning Approach | May 20, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Time Allocation and Directional Transmission in Joint Radar-Communication | May 19, 2022 | Autonomous VehiclesDecision Making Under Uncertainty | CodeCode Available | 1 |
| Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks | May 19, 2022 | Deep Reinforcement LearningTransfer Learning | CodeCode Available | 0 |
| Data Valuation for Offline Reinforcement Learning | May 19, 2022 | Data ValuationDeep Reinforcement Learning | —Unverified | 0 |
| Routing and Placement of Macros using Deep Reinforcement Learning | May 19, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise | May 19, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Generating Explanations from Deep Reinforcement Learning Using Episodic Memory | May 18, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A2C is a special case of PPO | May 18, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning Based on Location-Aware Imitation Environment for RIS-Aided mmWave MIMO Systems | May 18, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability | May 18, 2022 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks | May 18, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Multibit Tries Packet Classification with Deep Reinforcement Learning | May 17, 2022 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Attacking and Defending Deep Reinforcement Learning Policies | May 16, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Blind AI in DareFightingICE | May 16, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Many Field Packet Classification with Decomposition and Reinforcement Learning | May 16, 2022 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| The Primacy Bias in Deep Reinforcement Learning | May 16, 2022 | Atari Games 100kDeep Reinforcement Learning | CodeCode Available | 1 |
| RoMFAC: A robust mean-field actor-critic reinforcement learning against adversarial perturbations on states | May 15, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning | May 14, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning in mmW-NOMA: Joint Power Allocation and Hybrid Beamforming | May 13, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |