| The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning | Oct 16, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| DyFEn: Agent-Based Fee Setting in Payment Channel Networks | Oct 15, 2022 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| CUP: Critic-Guided Policy Reuse | Oct 15, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based Environments | Oct 14, 2022 | Atari GamesBenchmarking | CodeCode Available | 1 |
| Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| ToupleGDD: A Fine-Designed Solution of Influence Maximization by Deep Reinforcement Learning | Oct 14, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Adaptive patch foraging in deep reinforcement learning agents | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion | Oct 14, 2022 | Deep Reinforcement LearningQuantization | CodeCode Available | 0 |
| A Scalable Finite Difference Method for Deep Reinforcement Learning | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations | Oct 14, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning | Oct 14, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Deep reinforcement learning for automatic run-time adaptation of UWB PHY radio settings | Oct 13, 2022 | Deep Reinforcement LearningIndoor Localization | —Unverified | 0 |
| ProSky: NEAT Meets NOMA-mmWave in the Sky of 6G | Oct 13, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts | Oct 13, 2022 | Atari GamesDecision Making | CodeCode Available | 2 |
| Deep Reinforcement Learning-based Rebalancing Policies for Profit Maximization of Relay Nodes in Payment Channel Networks | Oct 13, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Towards Trustworthy Automatic Diagnosis Systems by Emulating Doctors' Reasoning with Deep Reinforcement Learning | Oct 13, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations | Oct 13, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Transfer Deep Reinforcement Learning-based Large-scale V2G Continuous Charging Coordination with Renewable Energy Sources | Oct 13, 2022 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Observed Adversaries in Deep Reinforcement Learning | Oct 13, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning | Oct 12, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning | Oct 12, 2022 | Autonomous NavigationCollision Avoidance | —Unverified | 0 |
| Point Cloud Scene Completion with Joint Color and Semantic Estimation from Single RGB-D Image | Oct 12, 2022 | Deep Reinforcement LearningImage Inpainting | —Unverified | 0 |
| Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSI | Oct 10, 2022 | Deep Reinforcement Learning | CodeCode Available | 2 |
| Multiagent Reinforcement Learning Based on Fusion-Multiactor-Attention-Critic for Multiple-Unmanned-Aerial-Vehicle Navigation Control | Oct 10, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Reducing Action Space: Reference-Model-Assisted Deep Reinforcement Learning for Inverter-based Volt-Var Control | Oct 10, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Exploring Adaptive MCTS with TD Learning in miniXCOM | Oct 10, 2022 | Board GamesDeep Reinforcement Learning | —Unverified | 0 |
| Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems | Oct 10, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Simulating Coverage Path Planning with Roomba | Oct 10, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Benchmarking Reinforcement Learning Techniques for Autonomous Navigation | Oct 10, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| High Performance on Atari Games Using Perceptual Control Architecture Without Training | Oct 8, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems | Oct 8, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning | Oct 7, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Algorithmic Trading Using Continuous Action Space Deep Reinforcement Learning | Oct 7, 2022 | Algorithmic TradingDeep Reinforcement Learning | —Unverified | 0 |
| How to Enable Uncertainty Estimation in Proximal Policy Optimization | Oct 7, 2022 | Deep Reinforcement LearningOut of Distribution (OOD) Detection | —Unverified | 0 |
| Deep Inventory Management | Oct 6, 2022 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery | Oct 6, 2022 | Deep Reinforcement LearningDiversity | CodeCode Available | 1 |
| Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing | Oct 6, 2022 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Scaling up Stochastic Gradient Descent for Non-convex Optimisation | Oct 6, 2022 | Deep Reinforcement LearningVariational Inference | —Unverified | 0 |
| Deep Reinforcement Learning based Evasion Generative Adversarial Network for Botnet Detection | Oct 6, 2022 | Deep Reinforcement LearningGenerative Adversarial Network | CodeCode Available | 1 |
| Discovering faster matrix multiplication algorithms with reinforcement learning | Oct 5, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 4 |
| Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Real-Time Reinforcement Learning for Vision-Based Robotics Utilizing Local and Remote Computers | Oct 5, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| On Neural Consolidation for Transfer in Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| DISCOVER: Deep identification of symbolically concise open-form PDEs via enhanced reinforcement-learning | Oct 4, 2022 | Deep Reinforcement LearningForm | CodeCode Available | 1 |
| Using Deep Reinforcement Learning for mmWave Real-Time Scheduling | Oct 4, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Hyperbolic Deep Reinforcement Learning | Oct 4, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders | Oct 3, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic Specifications | Oct 3, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Cooperative Multi-Agent Deep Reinforcement Learning for Reliable and Energy-Efficient Mobile Access via Multi-UAV Control | Oct 3, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Deep Learning for Wireless Networked Systems: a joint Estimation-Control-Scheduling Approach | Oct 3, 2022 | Deep Reinforcement LearningScheduling | —Unverified | 0 |