| Soft Actor-Critic with Beta Policy via Implicit Reparameterization Gradients | Sep 8, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| QuantFactor REINFORCE: Mining Steady Formulaic Alpha Factors with Variance-bounded REINFORCE | Sep 8, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn | Sep 7, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Simplex-enabled Safe Continual Learning Machine | Sep 5, 2024 | Continual LearningDeep Reinforcement Learning | —Unverified | 0 |
| Reinforcement-Learning-Enabled Beam Alignment for Water-Air Direct Optical Wireless Communications | Sep 5, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Sparsifying Parametric Models with L0 Regularization | Sep 5, 2024 | Deep Reinforcement LearningDictionary Learning | CodeCode Available | 0 |
| Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem | Sep 4, 2024 | Deep Reinforcement LearningJob Shop Scheduling | —Unverified | 0 |
| A Deep Reinforcement Learning Framework For Financial Portfolio Management | Sep 3, 2024 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| AI Olympics challenge with Evolutionary Soft Actor Critic | Sep 2, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach | Sep 2, 2024 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |