| Targeted Environment Design from Offline Data | Sep 29, 2021 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| The Challenges of Exploration for Offline Reinforcement Learning | Jan 27, 2022 | Model Predictive ControlOffline RL | —Unverified | 0 | 0 |
| The Essential Elements of Offline RL via Supervised Learning | Sep 29, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| The Least Restriction for Offline Reinforcement Learning | Jul 5, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| The Pitfalls of Imitation Learning when Actions are Continuous | Mar 12, 2025 | ChunkingImitation Learning | —Unverified | 0 | 0 |
| The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning | Feb 27, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line | Aug 16, 2020 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| The Role of Coverage in Online Reinforcement Learning | Oct 9, 2022 | Efficient ExplorationOffline RL | —Unverified | 0 | 0 |
| The Role of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation | Jun 17, 2024 | Offline RL | —Unverified | 0 | 0 |
| The Value of Reward Lookahead in Reinforcement Learning | Mar 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| The Virtues of Pessimism in Inverse Reinforcement Learning | Feb 4, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning | Jul 1, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers | Jun 16, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers | Apr 28, 2022 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Towards Generalizable Reinforcement Learning for Trade Execution | May 12, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Towards Instance-Optimal Offline Reinforcement Learning with Pessimism | Oct 17, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Towards Optimal Differentially Private Regret Bounds in Linear MDPs | Apr 12, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning | Mar 9, 2024 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses | May 18, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Tractable Offline Learning of Regular Decision Processes | Sep 4, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear q^π-Realizability and Concentrability | May 27, 2024 | Computational EfficiencyOffline RL | —Unverified | 0 | 0 |
| Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding | Feb 23, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Transferred Q-learning | Feb 9, 2022 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning | Jun 5, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning | Nov 22, 2021 | Decision MakingOffline RL | —Unverified | 0 | 0 |