| Stability Constrained Reinforcement Learning for Decentralized Real-Time Voltage Control | Sep 16, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping | Sep 15, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning to Solve Multiple-TSP with Time Window and Rejections via Deep Reinforcement Learning | Sep 13, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting | Sep 12, 2022 | Algorithmic TradingDeep Reinforcement Learning | CodeCode Available | 1 |
| Actor Prioritized Experience Replay | Sep 1, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Rethinking Conversational Recommendations: Is Decision Tree All You Need? | Aug 31, 2022 | AllDeep Reinforcement Learning | CodeCode Available | 1 |
| Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning | Aug 30, 2022 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| AutoShard: Automated Embedding Table Sharding for Recommender Systems | Aug 12, 2022 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep Reinforcement Learning | Aug 11, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Automating DBSCAN via Deep Reinforcement Learning | Aug 9, 2022 | ClusteringComputational Efficiency | CodeCode Available | 1 |
| Object Detection with Deep Reinforcement Learning | Aug 9, 2022 | Active Object LocalizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Mobility-Aware Cooperative Caching in Vehicular Edge Computing Based on Asynchronous Federated and Deep Reinforcement Learning | Aug 2, 2022 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 |
| Performance Comparison of Deep RL Algorithms for Energy Systems Optimal Scheduling | Aug 1, 2022 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| DRL-M4MR: An Intelligent Multicast Routing Approach Based on DQN Deep Reinforcement Learning in SDN | Jul 31, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Unified Automatic Control of Vehicular Systems with Reinforcement Learning | Jul 30, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning | Jul 29, 2022 | Contrastive LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation | Jul 27, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts | Jul 24, 2022 | Deep Reinforcement LearningHumanoid Control | CodeCode Available | 1 |
| Reinforcement learning for Energies of the future and carbon neutrality: a Challenge Design | Jul 21, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Deep Reinforcement Learning for Market Making Under a Hawkes Process-Based Limit Order Book Model | Jul 20, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games | Jul 18, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Asset Allocation: From Markowitz to Deep Reinforcement Learning | Jul 14, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Solving the Traveling Salesperson Problem with Precedence Constraints by Deep Reinforcement Learning | Jul 4, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Stabilizing Off-Policy Deep Reinforcement Learning from Pixels | Jul 3, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse | Jun 28, 2022 | Continuous ControlDecision Making | CodeCode Available | 1 |
| Toward multi-target self-organizing pursuit in a partially observable Markov game | Jun 24, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Finite Expression Method for Solving High-Dimensional Partial Differential Equations | Jun 21, 2022 | Deep Reinforcement LearningVocal Bursts Intensity Prediction | CodeCode Available | 1 |
| Deep Reinforcement Learning for Turbulence Modeling in Large Eddy Simulations | Jun 21, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum | Jun 21, 2022 | Adversarial RobustnessDeep Reinforcement Learning | CodeCode Available | 1 |
| Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration | Jun 20, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments | Jun 17, 2022 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| A Search-Based Testing Approach for Deep Reinforcement Learning Agents | Jun 15, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| RoSGAS: Adaptive Social Bot Detection with Reinforced Self-Supervised GNN Architecture Search | Jun 14, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Reinforcement Learning-based Placement of Charging Stations in Urban Road Networks | Jun 13, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games | Jun 12, 2022 | Deep Reinforcement LearningMuJoCo Games | CodeCode Available | 1 |
| Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk | Jun 9, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch | May 30, 2022 | Continuous ControlDeep Reinforcement Learning | CodeCode Available | 1 |
| Sym-NCO: Leveraging Symmetricity for Neural Combinatorial Optimization | May 26, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| DRLComplex: Reconstruction of protein quaternary structures using deep reinforcement learning | May 26, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| TreEnhance: A Tree Search Method For Low-Light Image Enhancement | May 25, 2022 | Deep Reinforcement LearningImage Enhancement | CodeCode Available | 1 |
| Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation | May 22, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Time Allocation and Directional Transmission in Joint Radar-Communication | May 19, 2022 | Autonomous VehiclesDecision Making Under Uncertainty | CodeCode Available | 1 |
| A2C is a special case of PPO | May 18, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| The Primacy Bias in Deep Reinforcement Learning | May 16, 2022 | Atari Games 100kDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Computational Fluid Dynamics on HPC Systems | May 13, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Intelligent Reflecting Surface Configurations for Smart Radio Using Deep Reinforcement Learning | May 11, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces | May 8, 2022 | BIG-bench Machine LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach | Apr 26, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| HyperNCA: Growing Developmental Networks with Neural Cellular Automata | Apr 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply Chains | Apr 20, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |