| Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare | Oct 10, 2024 | Common Sense ReasoningData Augmentation | —Unverified | 0 |
| The Smart Buildings Control Suite: A Diverse Open Source Benchmark to Evaluate and Scale HVAC Control Policies for Sustainability | Oct 2, 2024 | Model Predictive ControlOffline RL | —Unverified | 0 |
| ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization | Oct 2, 2024 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| OffRIPP: Offline RL-based Informative Path Planning | Sep 25, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm | Sep 24, 2024 | Offline RLOff-policy evaluation | —Unverified | 0 |
| KAN v.s. MLP for Offline Reinforcement Learning | Sep 15, 2024 | D4RLKolmogorov-Arnold Networks | —Unverified | 0 |
| Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning | Sep 12, 2024 | D4RLOffline RL | —Unverified | 0 |
| Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention | Sep 11, 2024 | Offline RL | —Unverified | 0 |
| The Role of Deep Learning Regularizations on Actors in Offline RL | Sep 11, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Tractable Offline Learning of Regular Decision Processes | Sep 4, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning | Aug 28, 2024 | Drone navigationOffline RL | —Unverified | 0 |
| Unsupervised-to-Online Reinforcement Learning | Aug 27, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning | Aug 27, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning | Aug 23, 2024 | D4RLOffline RL | —Unverified | 0 |
| Domain Adaptation for Offline Reinforcement Learning with Limited Samples | Aug 22, 2024 | Domain AdaptationOffline RL | —Unverified | 0 |
| Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning | Aug 22, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Preference-Guided Reflective Sampling for Aligning Language Models | Aug 22, 2024 | Document SummarizationInstruction Following | CodeCode Available | 0 |
| Offline Model-Based Reinforcement Learning with Anti-Exploration | Aug 20, 2024 | D4RLmodel | —Unverified | 0 |
| Integrating Multi-Modal Input Token Mixer Into Mamba-Based Decision Models: Decision MetaMamba | Aug 20, 2024 | MambaOffline RL | —Unverified | 0 |
| Enhancing Reinforcement Learning Through Guided Search | Aug 19, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds | Aug 16, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 |
| Experimental evaluation of offline reinforcement learning for HVAC control in buildings | Aug 15, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning | Aug 15, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs | Aug 8, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Consistent time travel for realistic interactions with historical data: reinforcement learning for market making | Aug 5, 2024 | Offline RL | —Unverified | 0 |
| Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning | Jul 29, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Language-Conditioned Offline RL for Multi-Robot Navigation | Jul 29, 2024 | Offline RLRobot Navigation | —Unverified | 0 |
| Diffusion Models as Optimizers for Efficient Planning in Offline RL | Jul 23, 2024 | D4RLDecision Making | CodeCode Available | 0 |
| ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems | Jul 18, 2024 | Offline RLRecommendation Systems | CodeCode Available | 0 |
| Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning | Jul 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning | Jul 15, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 |
| Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning | Jul 10, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| FOSP: Fine-tuning Offline Safe Policy through World Models | Jul 6, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 |
| Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling | Jul 5, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning | Jul 1, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning | Jun 30, 2024 | D4RLOffline RL | —Unverified | 0 |
| Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators | Jun 30, 2024 | Autonomous VehiclesOffline RL | —Unverified | 0 |
| Preference Elicitation for Offline Reinforcement Learning | Jun 26, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Equivariant Offline Reinforcement Learning | Jun 20, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing | Jun 20, 2024 | Autonomous DrivingData Augmentation | —Unverified | 0 |
| Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback | Jun 18, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| The Role of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation | Jun 17, 2024 | Offline RL | —Unverified | 0 |
| Binary Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning | Jun 14, 2024 | D4RLOffline RL | —Unverified | 0 |
| SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 |
| A Dual Approach to Imitation Learning from Observations with Offline Datasets | Jun 13, 2024 | Imitation LearningOffline RL | —Unverified | 0 |
| DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 |
| Augmenting Offline RL with Unlabeled Data | Jun 11, 2024 | Offline RLTransfer Learning | —Unverified | 0 |
| CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Jun 11, 2024 | D4RLDenoising | —Unverified | 0 |
| Integrating Domain Knowledge for handling Limited Data in Offline RL | Jun 11, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning | Jun 10, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |