| Language and Culture Internalisation for Human-Like Autotelic AI | Jun 2, 2022 | AttributeCultural Vocal Bursts Intensity Prediction | —Unverified | 0 |
| HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning | Jun 2, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning | Jun 2, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation | Jun 1, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning | Jun 1, 2022 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| Lessons Learned from Data-Driven Building Control Experiments: Contrasting Gaussian Process-based MPC, Bilevel DeePC, and Deep Reinforcement Learning | May 31, 2022 | Deep Reinforcement LearningGaussian Processes | —Unverified | 0 |
| Graph Backup: Data Efficient Backup Exploiting Markovian Transitions | May 31, 2022 | Atari Gamescounterfactual | CodeCode Available | 0 |
| Robust Longitudinal Control for Vehicular Autonomous Platoons Using Deep Reinforcement Learning | May 31, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning | May 30, 2022 | Data PoisoningDeep Reinforcement Learning | CodeCode Available | 0 |
| Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning | May 29, 2022 | Continuous ControlDeep Reinforcement Learning | —Unverified | 0 |
| Survival Analysis on Structured Data using Deep Reinforcement Learning | May 28, 2022 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis | May 27, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Guided Exploration of Data Summaries | May 27, 2022 | Data SummarizationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Distributed and Uncoordinated Cognitive Radios Resource Allocation | May 27, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Physics-Guided Hierarchical Reward Mechanism for Learning-Based Robotic Grasping | May 26, 2022 | Computational EfficiencyDeep Reinforcement Learning | —Unverified | 0 |
| Dynamic Network Reconfiguration for Entropy Maximization using Deep Reinforcement Learning | May 26, 2022 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning | May 26, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Multi-Phase Multi-Objective Dexterous Manipulation with Adaptive Hierarchical Curriculum | May 26, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Verifying Learning-Based Robotic Navigation Systems | May 26, 2022 | Deep Reinforcement LearningModel Selection | —Unverified | 0 |
| DRL-based Resource Allocation in Remote State Estimation | May 24, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Multi-class Imbalanced Training | May 24, 2022 | Deep Reinforcement Learningimbalanced classification | CodeCode Available | 0 |
| Emergent Communication through Metropolis-Hastings Naming Game with Deep Generative Models | May 24, 2022 | Bayesian InferenceDeep Reinforcement Learning | CodeCode Available | 0 |
| Spreading Factor assisted LoRa Localization with Deep Reinforcement Learning | May 23, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Generalization, Mayhems and Limits in Recurrent Proximal Policy Optimization | May 23, 2022 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Co-design of Embodied Neural Intelligence via Constrained Evolution | May 21, 2022 | Deep Reinforcement LearningGPU | —Unverified | 0 |
| Task Relabelling for Multi-task Transfer using Successor Features | May 20, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Adversarial Body Shape Search for Legged Robots | May 20, 2022 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 |
| Long Run Incremental Cost (LRIC) Distribution Network Pricing in UK, advising China's Distribution Network | May 20, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Adversarial joint attacks on legged robots | May 20, 2022 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| On Jointly Optimizing Partial Offloading and SFC Mapping: A Cooperative Dual-agent Deep Reinforcement Learning Approach | May 20, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Data Valuation for Offline Reinforcement Learning | May 19, 2022 | Data ValuationDeep Reinforcement Learning | —Unverified | 0 |
| Routing and Placement of Macros using Deep Reinforcement Learning | May 19, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise | May 19, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks | May 19, 2022 | Deep Reinforcement LearningTransfer Learning | CodeCode Available | 0 |
| Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks | May 18, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Deep Reinforcement Learning Based on Location-Aware Imitation Environment for RIS-Aided mmWave MIMO Systems | May 18, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Generating Explanations from Deep Reinforcement Learning Using Episodic Memory | May 18, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability | May 18, 2022 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Multibit Tries Packet Classification with Deep Reinforcement Learning | May 17, 2022 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Attacking and Defending Deep Reinforcement Learning Policies | May 16, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Blind AI in DareFightingICE | May 16, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Many Field Packet Classification with Decomposition and Reinforcement Learning | May 16, 2022 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| RoMFAC: A robust mean-field actor-critic reinforcement learning against adversarial perturbations on states | May 15, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning | May 14, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Joint Power Allocation and Beamformer for mmW-NOMA Downlink Systems by Deep Reinforcement Learning | May 13, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Deep Reinforcement Learning in mmW-NOMA: Joint Power Allocation and Hybrid Beamforming | May 13, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments | May 12, 2022 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| Learning to Guide Multiple Heterogeneous Actors from a Single Human Demonstration via Automatic Curriculum Learning in StarCraft II | May 11, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| On the Verge of Solving Rocket League using Deep Reinforcement Learning and Sim-to-sim Transfer | May 10, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Personalized QoE Enhancement for Adaptive Video Streaming: A Digital Twin-Assisted Scheme | May 9, 2022 | Deep Reinforcement LearningManagement | —Unverified | 0 |