| Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations | Oct 14, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| A Scalable Finite Difference Method for Deep Reinforcement Learning | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion | Oct 14, 2022 | Deep Reinforcement LearningQuantization | CodeCode Available | 0 |
| Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning | Oct 14, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Observed Adversaries in Deep Reinforcement Learning | Oct 13, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| ProSky: NEAT Meets NOMA-mmWave in the Sky of 6G | Oct 13, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Deep reinforcement learning for automatic run-time adaptation of UWB PHY radio settings | Oct 13, 2022 | Deep Reinforcement LearningIndoor Localization | —Unverified | 0 |
| Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations | Oct 13, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Transfer Deep Reinforcement Learning-based Large-scale V2G Continuous Charging Coordination with Renewable Energy Sources | Oct 13, 2022 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Point Cloud Scene Completion with Joint Color and Semantic Estimation from Single RGB-D Image | Oct 12, 2022 | Deep Reinforcement LearningImage Inpainting | —Unverified | 0 |
| Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning | Oct 12, 2022 | Autonomous NavigationCollision Avoidance | —Unverified | 0 |
| Exploring Adaptive MCTS with TD Learning in miniXCOM | Oct 10, 2022 | Board GamesDeep Reinforcement Learning | —Unverified | 0 |
| Simulating Coverage Path Planning with Roomba | Oct 10, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems | Oct 10, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Reducing Action Space: Reference-Model-Assisted Deep Reinforcement Learning for Inverter-based Volt-Var Control | Oct 10, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Algorithmic Trading Using Continuous Action Space Deep Reinforcement Learning | Oct 7, 2022 | Algorithmic TradingDeep Reinforcement Learning | —Unverified | 0 |
| How to Enable Uncertainty Estimation in Proximal Policy Optimization | Oct 7, 2022 | Deep Reinforcement LearningOut of Distribution (OOD) Detection | —Unverified | 0 |
| Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing | Oct 6, 2022 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Scaling up Stochastic Gradient Descent for Non-convex Optimisation | Oct 6, 2022 | Deep Reinforcement LearningVariational Inference | —Unverified | 0 |
| Deep Inventory Management | Oct 6, 2022 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| On Neural Consolidation for Transfer in Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Using Deep Reinforcement Learning for mmWave Real-Time Scheduling | Oct 4, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Hyperbolic Deep Reinforcement Learning | Oct 4, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders | Oct 3, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |