| Uncertainty-aware Distributional Offline Reinforcement Learning | Mar 26, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling | Mar 25, 2024 | Offline RLRecommendation Systems | —Unverified | 0 |
| The Value of Reward Lookahead in Reinforcement Learning | Mar 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning | Mar 14, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning | Mar 9, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| Why Online Reinforcement Learning is Causal | Mar 7, 2024 | counterfactualOffline RL | —Unverified | 0 |
| Offline Fictitious Self-Play for Competitive Games | Feb 29, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings | Feb 27, 2024 | DiversityOffline RL | CodeCode Available | 2 |
| Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding | Feb 23, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions | Feb 21, 2024 | Decision MakingImitation Learning | CodeCode Available | 2 |
| MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces | Feb 20, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 |
| Offline Multi-task Transfer RL with Representational Penalization | Feb 19, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Learning Goal-Conditioned Policies from Sub-Optimal Offline Data via Metric Learning | Feb 16, 2024 | Metric LearningOffline RL | —Unverified | 0 |
| Universal Black-Box Reward Poisoning Attack against Offline Reinforcement Learning | Feb 15, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Measurement Scheduling for ICU Patients with Offline Reinforcement Learning | Feb 12, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL | Feb 11, 2024 | Offline RL | CodeCode Available | 1 |
| More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning | Feb 11, 2024 | Distributional Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 |
| Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices | Feb 8, 2024 | Federated LearningOffline RL | —Unverified | 0 |
| Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning | Feb 8, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Offline Actor-Critic Reinforcement Learning Scales to Large Models | Feb 8, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs | Feb 7, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Feb 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| SEABO: A Simple Search-Based Method for Offline Imitation Learning | Feb 6, 2024 | D4RLImitation Learning | CodeCode Available | 1 |
| Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning | Feb 5, 2024 | Contrastive LearningD4RL | —Unverified | 0 |