| Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration | Nov 9, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Pretraining in Deep Reinforcement Learning: A Survey | Nov 8, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning | Nov 6, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 |
| ProtoX: Explaining a Reinforcement Learning Agent via Prototyping | Nov 6, 2022 | Contrastive LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Spatio-temporal Incentives Optimization for Ride-hailing Services with Offline Deep Reinforcement Learning | Nov 6, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Survey on Influence Maximization: From an ML-Based Combinatorial Optimization | Nov 6, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Diversity-based Deep Reinforcement Learning Towards Multidimensional Difficulty for Fighting Game AI | Nov 4, 2022 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Fair and Efficient Distributed Edge Learning with Hybrid Multipath TCP | Nov 3, 2022 | AvgDeep Reinforcement Learning | —Unverified | 0 |
| Theta-Resonance: A Single-Step Reinforcement Learning Method for Design Space Exploration | Nov 3, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for IRS Phase Shift Design in Spatiotemporally Correlated Environments | Nov 2, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Wind Power Forecasting Considering Data Privacy Protection: A Federated Deep Reinforcement Learning Approach | Nov 2, 2022 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Power Control in Next-Generation WiFi Network Systems | Nov 2, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Spatial-temporal recurrent reinforcement learning for autonomous ships | Nov 2, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning Adaptive Evolutionary Computation for Solving Multi-Objective Optimization Problems | Nov 1, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Event Tables for Efficient Experience Replay | Nov 1, 2022 | Car RacingDeep Reinforcement Learning | —Unverified | 0 |
| Online Control of Adaptive Large Neighborhood Search using Deep Reinforcement Learning | Nov 1, 2022 | Bayesian OptimizationCombinatorial Optimization | CodeCode Available | 1 |
| CPG-RL: Learning Central Pattern Generators for Quadruped Locomotion | Nov 1, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Teacher-student curriculum learning for reinforcement learning | Oct 31, 2022 | Board GamesDecision Making | —Unverified | 0 |
| Imitating Opponent to Win: Adversarial Policy Imitation Learning in Two-player Competitive Games | Oct 30, 2022 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Network Aware Compute and Memory Allocation in Optically Composable Data Centres with Deep Reinforcement Learning and Graph Neural Networks | Oct 26, 2022 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |
| ERL-Re^2: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation | Oct 26, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Knowledge-Guided Exploration in Deep Reinforcement Learning | Oct 26, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reachability Verification Based Reliability Assessment for Deep Reinforcement Learning Controlled Robotics and Autonomous Systems | Oct 26, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| DeXtreme: Transfer of Agile In-hand Manipulation from Simulation to Reality | Oct 25, 2022 | Deep Reinforcement LearningGPU | CodeCode Available | 4 |
| One-shot, Offline and Production-Scalable PID Optimisation with Deep Reinforcement Learning | Oct 25, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |