| Why Online Reinforcement Learning is Causal | Mar 7, 2024 | counterfactualOffline RL | —Unverified | 0 |
| Offline Fictitious Self-Play for Competitive Games | Feb 29, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding | Feb 23, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 |
| MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces | Feb 20, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Offline Multi-task Transfer RL with Representational Penalization | Feb 19, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Learning Goal-Conditioned Policies from Sub-Optimal Offline Data via Metric Learning | Feb 16, 2024 | Metric LearningOffline RL | —Unverified | 0 |
| Universal Black-Box Reward Poisoning Attack against Offline Reinforcement Learning | Feb 15, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Measurement Scheduling for ICU Patients with Offline Reinforcement Learning | Feb 12, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning | Feb 11, 2024 | Distributional Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 |