| Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning | Nov 7, 2024 | D4RLreinforcement-learning | CodeCode Available | 0 |
| Hypercube Policy Regularization Framework for Offline Reinforcement Learning | Nov 7, 2024 | D4RLGeneral Reinforcement Learning | CodeCode Available | 0 |
| Offline Behavior Distillation | Oct 30, 2024 | D4RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning | Oct 30, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network Simulation | Oct 30, 2024 | D4RLManagement | CodeCode Available | 0 |
| Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model | Oct 27, 2024 | D4RLQ-Learning | CodeCode Available | 0 |
| SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance | Oct 24, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space | Oct 21, 2024 | ClusteringD4RL | —Unverified | 0 |
| Rethinking Optimal Transport in Offline Reinforcement Learning | Oct 17, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Oct 15, 2024 | D4RLModel-based Reinforcement Learning | CodeCode Available | 0 |
| Diffusion Model Predictive Control | Oct 7, 2024 | D4RLmodel | —Unverified | 0 |
| KAN v.s. MLP for Offline Reinforcement Learning | Sep 15, 2024 | D4RLKolmogorov-Arnold Networks | —Unverified | 0 |
| Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens | Sep 14, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning | Sep 12, 2024 | D4RLOffline RL | —Unverified | 0 |
| The Role of Deep Learning Regularizations on Actors in Offline RL | Sep 11, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Sep 9, 2024 | D4RLDecision Making | —Unverified | 0 |
| SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning | Aug 23, 2024 | D4RLOffline RL | —Unverified | 0 |
| Offline Model-Based Reinforcement Learning with Anti-Exploration | Aug 20, 2024 | D4RLmodel | —Unverified | 0 |
| SelfBC: Self Behavior Cloning for Offline Reinforcement Learning | Aug 4, 2024 | AttributeD4RL | —Unverified | 0 |
| Diffusion Models as Optimizers for Efficient Planning in Offline RL | Jul 23, 2024 | D4RLDecision Making | CodeCode Available | 0 |
| Offline Reinforcement Learning with Imputed Rewards | Jul 15, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning | Jun 30, 2024 | D4RLOffline RL | —Unverified | 0 |
| Binary Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning | Jun 14, 2024 | D4RLOffline RL | —Unverified | 0 |
| SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 |
| DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 |