| Enabling A Network AI Gym for Autonomous Cyber Agents | Apr 3, 2023 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Solving Dynamic Traveling Salesman Problems With Deep Reinforcement Learning | Apr 1, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Multi-Microgrid Collaborative Optimization Scheduling Using an Improved Multi-Agent Soft Actor-Critic Algorithm | Apr 1, 2023 | AutoMLDeep Reinforcement Learning | —Unverified | 0 |
| Physical Deep Reinforcement Learning Towards Safety Guarantee | Mar 29, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations | Mar 29, 2023 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Quantum Deep Hedging | Mar 29, 2023 | Deep Reinforcement LearningQuantum Machine Learning | —Unverified | 0 |
| On the Use of Reinforcement Learning for Attacking and Defending Load Frequency Control | Mar 28, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Adaptive Background Music for a Fighting Game: A Multi-Instrument Volume Modulation Approach | Mar 28, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning | Mar 27, 2023 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Robust Path Following on Rivers Using Bootstrapped Reinforcement Learning | Mar 24, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Optimal Smoothing Distribution Exploration for Backdoor Neutralization in Deep Learning-based Traffic Systems | Mar 24, 2023 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| RLOR: A Flexible Framework of Deep Reinforcement Learning for Operation Research | Mar 23, 2023 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems | Mar 23, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| HAPS-UAV-Enabled Heterogeneous Networks: A Deep Reinforcement Learning Approach | Mar 22, 2023 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Deep Reinforcement Learning for Localizability-Enhanced Navigation in Dynamic Human Environments | Mar 22, 2023 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Distributed Two-tier DRL Framework for Cell-Free Network: Association, Beamforming and Power Allocation | Mar 22, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 |
| P^3O: Transferring Visual Representations for Reinforcement Learning via Prompting | Mar 22, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Wasserstein Auto-encoded MDPs: Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees | Mar 22, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Large-Scale Traffic Signal Control Using Constrained Network Partition and Adaptive Deep Reinforcement Learning | Mar 21, 2023 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Bridging Transient and Steady-State Performance in Voltage Control: A Reinforcement Learning Approach with Safe Gradient Flow | Mar 20, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Multi-modal reward for visual relationships-based image captioning | Mar 19, 2023 | Caption GenerationDeep Reinforcement Learning | —Unverified | 0 |
| Active hypothesis testing in unknown environments using recurrent neural networks and model free reinforcement learning | Mar 19, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Mobile Edge Adversarial Detection for Digital Twinning to the Metaverse with Deep Reinforcement Learning | Mar 18, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Conversational Tree Search: A New Hybrid Dialog Task | Mar 17, 2023 | Deep Reinforcement LearningInformation Retrieval | CodeCode Available | 0 |
| Measurement Optimization under Uncertainty using Deep Reinforcement Learning | Mar 17, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Energy Management of Multi-mode Plug-in Hybrid Electric Vehicle using Multi-agent Deep Reinforcement Learning | Mar 16, 2023 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Efficient Learning of High Level Plans from Play | Mar 16, 2023 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics | Mar 16, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Residual Physics Learning and System Identification for Sim-to-real Transfer of Policies on Buoyancy Assisted Legged Robots | Mar 16, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning | Mar 16, 2023 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning | Mar 15, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning to Transfer In-Hand Manipulations Using a Greedy Shape Curriculum | Mar 14, 2023 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Learning Model-Free Robust Precoding for Cooperative Multibeam Satellite Communications | Mar 13, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Loss of Plasticity in Continual Deep Reinforcement Learning | Mar 13, 2023 | Atari GamesContinual Learning | —Unverified | 0 |
| Synthetic Experience Replay | Mar 12, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| AutoDenoise: Automatic Data Instance Denoising for Recommendations | Mar 12, 2023 | Deep Reinforcement LearningDenoising | —Unverified | 0 |
| Towards Practical Multi-Robot Hybrid Tasks Allocation for Autonomous Cleaning | Mar 12, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning Based Power Allocation for Minimizing AoI and Energy Consumption in MIMO-NOMA IoT Systems | Mar 11, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Understanding the Synergies between Quality-Diversity and Deep Reinforcement Learning | Mar 10, 2023 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Solving routing problems for multiple cooperative Unmanned Aerial Vehicles using Transformer networks, vol. 122, pp. 106085, 2023 | Mar 9, 2023 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| Conceptual Reinforcement Learning for Language-Conditioned Tasks | Mar 9, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Quantum Power Electronics: From Theory to Implementation | Mar 8, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Using Memory-Based Learning to Solve Tasks with State-Action Constraints | Mar 8, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Virtual Reality in Metaverse over Wireless Networks with User-centered Deep Reinforcement Learning | Mar 8, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Learning Bipedal Walking for Humanoids with Current Feedback | Mar 7, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 3 |
| Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play | Mar 7, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy | Mar 7, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Efficient Skill Acquisition for Complex Manipulation Tasks in Obstructed Environments | Mar 6, 2023 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| Deep symbolic regression for physics guided by units constraints: toward the automated discovery of physical laws | Mar 6, 2023 | Deep Reinforcement Learningregression | CodeCode Available | 3 |
| Swim: A General-Purpose, High-Performing, and Efficient Activation Function for Locomotion Control Tasks | Mar 5, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |