| One-shot, Offline and Production-Scalable PID Optimisation with Deep Reinforcement Learning | Oct 25, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| AACHER: Assorted Actor-Critic Deep Reinforcement Learning with Hindsight Experience Replay | Oct 24, 2022 | Deep Reinforcement LearningFetchPush-v1 | CodeCode Available | 0 |
| Graph Reinforcement Learning-based CNN Inference Offloading in Dynamic Edge Computing | Oct 24, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| OSS Mentor A framework for improving developers contributions via deep reinforcement learning | Oct 24, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Understanding the Evolution of Linear Regions in Deep Reinforcement Learning | Oct 24, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains | Oct 24, 2022 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon Manipulation | Oct 23, 2022 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| Learning General World Models in a Handful of Reward-Free Deployments | Oct 23, 2022 | Active LearningDeep Reinforcement Learning | —Unverified | 0 |
| Climate Change Policy Exploration using Reinforcement Learning | Oct 23, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems | Oct 22, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Cut-and-Approximate: 3D Shape Reconstruction from Planar Cross-sections with Deep Reinforcement Learning | Oct 22, 2022 | 3D Object Reconstruction3D Shape Reconstruction | —Unverified | 0 |
| Probing Transfer in Deep Reinforcement Learning without Task Engineering | Oct 22, 2022 | Deep Reinforcement LearningGame Design | —Unverified | 0 |
| Bridging the Gap Between Target Networks and Functional Regularization | Oct 21, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Stabilization of Large-scale Probabilistic Boolean Networks | Oct 21, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Inverse Inorganic Materials Design | Oct 21, 2022 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Rate-Splitting for Intelligent Reflecting Surface-Aided Multiuser VR Streaming | Oct 21, 2022 | Continuous ControlDeep Reinforcement Learning | CodeCode Available | 0 |
| Towards Quantum-Enabled 6G Slicing | Oct 21, 2022 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 |
| Fine-Grained Session Recommendations in E-commerce using Deep Reinforcement Learning | Oct 20, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning | Oct 20, 2022 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Simple Emergent Action Representations from Multi-Task Policy Training | Oct 18, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Entropy Regularized Reinforcement Learning with Cascading Networks | Oct 16, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning | Oct 16, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| CUP: Critic-Guided Policy Reuse | Oct 15, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| DyFEn: Agent-Based Fee Setting in Payment Channel Networks | Oct 15, 2022 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Adaptive patch foraging in deep reinforcement learning agents | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations | Oct 14, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| A Scalable Finite Difference Method for Deep Reinforcement Learning | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion | Oct 14, 2022 | Deep Reinforcement LearningQuantization | CodeCode Available | 0 |
| Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning | Oct 14, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Observed Adversaries in Deep Reinforcement Learning | Oct 13, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| ProSky: NEAT Meets NOMA-mmWave in the Sky of 6G | Oct 13, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Deep reinforcement learning for automatic run-time adaptation of UWB PHY radio settings | Oct 13, 2022 | Deep Reinforcement LearningIndoor Localization | —Unverified | 0 |
| Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations | Oct 13, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Transfer Deep Reinforcement Learning-based Large-scale V2G Continuous Charging Coordination with Renewable Energy Sources | Oct 13, 2022 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Point Cloud Scene Completion with Joint Color and Semantic Estimation from Single RGB-D Image | Oct 12, 2022 | Deep Reinforcement LearningImage Inpainting | —Unverified | 0 |
| Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning | Oct 12, 2022 | Autonomous NavigationCollision Avoidance | —Unverified | 0 |
| Exploring Adaptive MCTS with TD Learning in miniXCOM | Oct 10, 2022 | Board GamesDeep Reinforcement Learning | —Unverified | 0 |
| Simulating Coverage Path Planning with Roomba | Oct 10, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems | Oct 10, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Reducing Action Space: Reference-Model-Assisted Deep Reinforcement Learning for Inverter-based Volt-Var Control | Oct 10, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Algorithmic Trading Using Continuous Action Space Deep Reinforcement Learning | Oct 7, 2022 | Algorithmic TradingDeep Reinforcement Learning | —Unverified | 0 |
| How to Enable Uncertainty Estimation in Proximal Policy Optimization | Oct 7, 2022 | Deep Reinforcement LearningOut of Distribution (OOD) Detection | —Unverified | 0 |
| Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing | Oct 6, 2022 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Scaling up Stochastic Gradient Descent for Non-convex Optimisation | Oct 6, 2022 | Deep Reinforcement LearningVariational Inference | —Unverified | 0 |
| Deep Inventory Management | Oct 6, 2022 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| On Neural Consolidation for Transfer in Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Using Deep Reinforcement Learning for mmWave Real-Time Scheduling | Oct 4, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Hyperbolic Deep Reinforcement Learning | Oct 4, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders | Oct 3, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |