| Measurement Scheduling for ICU Patients with Offline Reinforcement Learning | Feb 12, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL | Feb 11, 2024 | Offline RL | CodeCode Available | 1 |
| More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning | Feb 11, 2024 | Distributional Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 |
| Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices | Feb 8, 2024 | Federated LearningOffline RL | —Unverified | 0 |
| Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning | Feb 8, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Offline Actor-Critic Reinforcement Learning Scales to Large Models | Feb 8, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs | Feb 7, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Feb 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| SEABO: A Simple Search-Based Method for Offline Imitation Learning | Feb 6, 2024 | D4RLImitation Learning | CodeCode Available | 1 |
| Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning | Feb 5, 2024 | Contrastive LearningD4RL | —Unverified | 0 |