| Toward Safe and Accelerated Deep Reinforcement Learning for Next-Generation Wireless Networks | Sep 16, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping | Sep 15, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning to Solve Multiple-TSP with Time Window and Rejections via Deep Reinforcement Learning | Sep 13, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting | Sep 12, 2022 | Algorithmic TradingDeep Reinforcement Learning | CodeCode Available | 1 |
| Actor Prioritized Experience Replay | Sep 1, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Rethinking Conversational Recommendations: Is Decision Tree All You Need? | Aug 31, 2022 | AllDeep Reinforcement Learning | CodeCode Available | 1 |
| Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning | Aug 30, 2022 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| AutoShard: Automated Embedding Table Sharding for Recommender Systems | Aug 12, 2022 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep Reinforcement Learning | Aug 11, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Object Detection with Deep Reinforcement Learning | Aug 9, 2022 | Active Object LocalizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Automating DBSCAN via Deep Reinforcement Learning | Aug 9, 2022 | ClusteringComputational Efficiency | CodeCode Available | 1 |
| Mobility-Aware Cooperative Caching in Vehicular Edge Computing Based on Asynchronous Federated and Deep Reinforcement Learning | Aug 2, 2022 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 |
| Performance Comparison of Deep RL Algorithms for Energy Systems Optimal Scheduling | Aug 1, 2022 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| DRL-M4MR: An Intelligent Multicast Routing Approach Based on DQN Deep Reinforcement Learning in SDN | Jul 31, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Unified Automatic Control of Vehicular Systems with Reinforcement Learning | Jul 30, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning | Jul 29, 2022 | Contrastive LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation | Jul 27, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts | Jul 24, 2022 | Deep Reinforcement LearningHumanoid Control | CodeCode Available | 1 |
| Reinforcement learning for Energies of the future and carbon neutrality: a Challenge Design | Jul 21, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Deep Reinforcement Learning for Market Making Under a Hawkes Process-Based Limit Order Book Model | Jul 20, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games | Jul 18, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Asset Allocation: From Markowitz to Deep Reinforcement Learning | Jul 14, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Solving the Traveling Salesperson Problem with Precedence Constraints by Deep Reinforcement Learning | Jul 4, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Stabilizing Off-Policy Deep Reinforcement Learning from Pixels | Jul 3, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse | Jun 28, 2022 | Continuous ControlDecision Making | CodeCode Available | 1 |