| Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks | Aug 20, 2024 | Multi-agent Reinforcement LearningMulti-Task Learning | CodeCode Available | 2 |
| Offline Model-Based Reinforcement Learning with Anti-Exploration | Aug 20, 2024 | D4RLmodel | —Unverified | 0 |
| Integrating Multi-Modal Input Token Mixer Into Mamba-Based Decision Models: Decision MetaMamba | Aug 20, 2024 | MambaOffline RL | —Unverified | 0 |
| Enhancing Reinforcement Learning Through Guided Search | Aug 19, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds | Aug 16, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 |
| D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning | Aug 15, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Experimental evaluation of offline reinforcement learning for HVAC control in buildings | Aug 15, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs | Aug 8, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Consistent time travel for realistic interactions with historical data: reinforcement learning for market making | Aug 5, 2024 | Offline RL | —Unverified | 0 |
| Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning | Jul 29, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |