| Soft Actor-Critic with Beta Policy via Implicit Reparameterization Gradients | Sep 8, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| QuantFactor REINFORCE: Mining Steady Formulaic Alpha Factors with Variance-bounded REINFORCE | Sep 8, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn | Sep 7, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Simplex-enabled Safe Continual Learning Machine | Sep 5, 2024 | Continual LearningDeep Reinforcement Learning | —Unverified | 0 |
| Sparsifying Parametric Models with L0 Regularization | Sep 5, 2024 | Deep Reinforcement LearningDictionary Learning | CodeCode Available | 0 |
| Reinforcement-Learning-Enabled Beam Alignment for Water-Air Direct Optical Wireless Communications | Sep 5, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem | Sep 4, 2024 | Deep Reinforcement LearningJob Shop Scheduling | —Unverified | 0 |
| A Deep Reinforcement Learning Framework For Financial Portfolio Management | Sep 3, 2024 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| AI Olympics challenge with Evolutionary Soft Actor Critic | Sep 2, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach | Sep 2, 2024 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Solving Integrated Process Planning and Scheduling Problem via Graph Neural Network Based Deep Reinforcement Learning | Sep 2, 2024 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 1 |
| Generalized Multi-hop Traffic Pressure for Heterogeneous Traffic Perimeter Control | Sep 1, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| AgGym: An agricultural biotic stress simulation environment for ultra-precision management planning | Sep 1, 2024 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Aug 30, 2024 | CPUDeep Reinforcement Learning | CodeCode Available | 1 |
| MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale | Aug 29, 2024 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 2 |
| Comparison of Model Predictive Control and Proximal Policy Optimization for a 1-DOF Helicopter System | Aug 28, 2024 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 |
| Statistical QoS Provision in Business-Centric Networks | Aug 28, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| MA-CDMR: An Intelligent Cross-domain Multicast Routing Method based on Multiagent Deep Reinforcement Learning in Multi-domain SDWN | Aug 27, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Earth Observation Satellite Scheduling with Graph Neural Networks | Aug 27, 2024 | Deep Reinforcement LearningEarth Observation | —Unverified | 0 |
| Model-Based Reinforcement Learning for Control of Strongly-Disturbed Unsteady Aerodynamic Flows | Aug 26, 2024 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Robot Navigation with Entity-Based Collision Avoidance using Deep Reinforcement Learning | Aug 26, 2024 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning Perspective | Aug 25, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Control-Informed Reinforcement Learning for Chemical Processes | Aug 24, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Synesthesia of Machines (SoM)-Enhanced ISAC Precoding for Vehicular Networks with Double Dynamics | Aug 24, 2024 | Deep Reinforcement LearningIntegrated sensing and communication | —Unverified | 0 |
| Localized Observation Abstraction Using Piecewise Linear Spatial Decay for Reinforcement Learning in Combat Simulations | Aug 23, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |