| POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning | Jan 1, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning | Jan 1, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Online Symbolic Music Alignment with Offline Reinforcement Learning | Dec 31, 2023 | Dynamic Time WarpingOffline RL | CodeCode Available | 1 |
| PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning | Dec 26, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Critic-Guided Decision Transformer for Offline Reinforcement Learning | Dec 21, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Neural Network Approximation for Pessimistic Offline Reinforcement Learning | Dec 19, 2023 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning | Dec 19, 2023 | NavigateOffline RL | —Unverified | 0 |
| Advancing RAN Slicing with Offline Reinforcement Learning | Dec 16, 2023 | ManagementOffline RL | —Unverified | 0 |
| Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach | Dec 12, 2023 | Knowledge DistillationOffline RL | CodeCode Available | 1 |
| A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning | Dec 12, 2023 | MuJoCoOffline RL | —Unverified | 0 |