| DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training | Jan 18, 2023 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Learning to solve arithmetic problems with a virtual abacus | Jan 17, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Adversarial Robust Deep Reinforcement Learning Requires Redefining Robustness | Jan 17, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| HiFlash: Communication-Efficient Hierarchical Federated Learning with Adaptive Staleness Control and Heterogeneity-aware Client-Edge Association | Jan 16, 2023 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| CogReact: A Reinforced Framework to Model Human Cognitive Reaction Modulated by Dynamic Intervention | Jan 15, 2023 | Deep Reinforcement LearningLogical Reasoning | —Unverified | 0 |
| Deep-Reinforcement-Learning-based Path Planning for Industrial Robots using Distance Sensors as Observation | Jan 14, 2023 | Deep Reinforcement LearningIndustrial Robots | CodeCode Available | 1 |
| Semantic and Effective Communication for Remote Control Tasks with Dynamic Feature Compression | Jan 14, 2023 | Deep Reinforcement LearningFeature Compression | —Unverified | 0 |
| Mutation Testing of Deep Reinforcement Learning Based on Real Faults | Jan 13, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| A Decentralized Pilot Assignment Algorithm for Scalable O-RAN Cell-Free Massive MIMO | Jan 12, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| SoK: Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning | Jan 11, 2023 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Why People Skip Music? On Predicting Music Skips using Deep Reinforcement Learning | Jan 10, 2023 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 0 |
| Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework | Jan 10, 2023 | Action ClassificationDecision Making | —Unverified | 0 |
| schlably: A Python Framework for Deep Reinforcement Learning Based Scheduling Experiments | Jan 10, 2023 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 |
| Multi-UAV Path Learning for Age and Power Optimization in IoT with UAV Battery Recharge | Jan 9, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Network Slicing via Transfer Learning aided Distributed Deep Reinforcement Learning | Jan 9, 2023 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Enabling AI-Generated Content (AIGC) Services in Wireless Edge Networks | Jan 9, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| XDQN: Inherently Interpretable DQN through Mimicking | Jan 8, 2023 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Centralized Cooperative Exploration Policy for Continuous Control Tasks | Jan 6, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A Deep Reinforcement Learning-Based Controller for Magnetorheological-Damped Vehicle Suspension | Jan 6, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| DRL-GAN: A Hybrid Approach for Binary and Multiclass Network Intrusion Detection | Jan 5, 2023 | Deep Reinforcement LearningGenerative Adversarial Network | —Unverified | 0 |
| Extreme Q-Learning: MaxEnt RL without Entropy | Jan 5, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 1 |
| Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations | Jan 4, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Machine Learning for Large-Scale Optimization in 6G Wireless Networks | Jan 3, 2023 | Computational EfficiencyDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Asset Allocation: Reward Clipping | Jan 2, 2023 | Deep Reinforcement LearningPortfolio Optimization | —Unverified | 0 |
| Deep reinforcement learning for irrigation scheduling using high-dimensional sensor feedback | Jan 2, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Environment Agnostic Representation for Visual Reinforcement Learning | Jan 1, 2023 | Deep Reinforcement LearningDomain Generalization | CodeCode Available | 1 |
| GAIT: Generating Aesthetic Indoor Tours with Deep Reinforcement Learning | Jan 1, 2023 | Deep Reinforcement LearningMixed Reality | —Unverified | 0 |
| Goal-Guided Transformer-Enabled Reinforcement Learning for Efficient Autonomous Navigation | Jan 1, 2023 | Autonomous NavigationDecision Making | CodeCode Available | 1 |
| Situation-Aware Deep Reinforcement Learning for Autonomous Nonlinear Mobility Control in Cyber-Physical Loitering Munition Systems | Dec 31, 2022 | Deep Reinforcement LearningUnity | —Unverified | 0 |
| Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning | Dec 31, 2022 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Transformer in Transformer as Backbone for Deep Reinforcement Learning | Dec 30, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Hybrid Deep Reinforcement Learning and Planning for Safe and Comfortable Automated Driving | Dec 30, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Tuning Synaptic Connections instead of Weights by Genetic Algorithm in Spiking Policy Network | Dec 29, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Visual CPG-RL: Learning Central Pattern Generators for Visually-Guided Quadruped Locomotion | Dec 29, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Federated Multi-Agent Deep Reinforcement Learning Approach via Physics-Informed Reward for Multi-Microgrid Energy Management | Dec 29, 2022 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| A Novel Experts Advice Aggregation Framework Using Deep Reinforcement Learning for Portfolio Management | Dec 29, 2022 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Towards automating Codenames spymasters with deep reinforcement learning | Dec 28, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Improving a sequence-to-sequence nlp model using a reinforcement learning policy algorithm | Dec 28, 2022 | ChatbotDeep Reinforcement Learning | —Unverified | 0 |
| On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations | Dec 28, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Wind and Energy Storage Coordination in Wholesale Energy and Ancillary Service Markets | Dec 27, 2022 | Deep Reinforcement Learningenergy trading | —Unverified | 0 |
| Bayesian Optimization Enhanced Deep Reinforcement Learning for Trajectory Planning and Network Formation in Multi-UAV Networks | Dec 27, 2022 | Bayesian OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Hierarchical Deep Reinforcement Learning for Age-of-Information Minimization in IRS-aided and Wireless-powered Wireless Networks | Dec 27, 2022 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Optimal scheduling of island integrated energy systems considering multi-uncertainties and hydrothermal simultaneous transmission: A deep reinforcement learning approach | Dec 27, 2022 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error | Dec 26, 2022 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Deep Reinforcement Learning for Heat Pump Control | Dec 24, 2022 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 |
| Structure-Enhanced DRL for Optimal Transmission Scheduling | Dec 24, 2022 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Coordinated Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Swarms in Autonomous Mobile Access Applications | Dec 23, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Example-guided learning of stochastic human driving policies using deep reinforcement learning | Dec 23, 2022 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| a cognitive frequency allocation strategy for multi-carrier radar against communication interference | Dec 23, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Proximal Policy Optimization with Graph Neural Networks for Optimal Power Flow | Dec 23, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |