| Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization | Jun 5, 2023 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic | Jun 5, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| A Novel Multi-Agent Deep RL Approach for Traffic Signal Control | Jun 5, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| For SALE: State-Action Representation Learning for Deep Reinforcement Learning | Jun 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference | Jun 2, 2023 | Collaborative InferenceCPU | —Unverified | 0 |
| Improving the generalizability and robustness of large-scale traffic signal control | Jun 2, 2023 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 |
| Hyperparameters in Reinforcement Learning and How To Tune Them | Jun 2, 2023 | AutoMLDeep Reinforcement Learning | —Unverified | 0 |
| Average AoI Minimization for Energy Harvesting Relay-aided Status Update Network Using Deep Reinforcement Learning | Jun 2, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages | Jun 2, 2023 | Bayesian Inferencecontinuous-control | CodeCode Available | 0 |
| Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task | Jun 2, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |