| Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning | Feb 8, 2024 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Real-World Offline Reinforcement Learning from Vision Language Model Feedback | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage | Feb 5, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Regularized Behavior Value Estimation | Mar 17, 2021 | Offline RL | —Unverified | 0 |
| Reinforced Self-Training (ReST) for Language Modeling | Aug 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reinforcement Learning: An Overview | Dec 6, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling | Mar 25, 2024 | Offline RLRecommendation Systems | —Unverified | 0 |
| Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data | May 14, 2025 | Offline RLreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism | May 29, 2023 | Decision MakingEconometrics | —Unverified | 0 |
| Reliable validation of Reinforcement Learning Benchmarks | Mar 2, 2022 | BenchmarkingData Compression | —Unverified | 0 |