| Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning | Jun 30, 2024 | D4RLOffline RL | —Unverified | 0 |
| Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach | May 8, 2025 | D4RLDecision Making | —Unverified | 0 |
| Targeted Environment Design from Offline Data | Sep 29, 2021 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| The Challenges of Exploration for Offline Reinforcement Learning | Jan 27, 2022 | Model Predictive ControlOffline RL | —Unverified | 0 |
| The Essential Elements of Offline RL via Supervised Learning | Sep 29, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| The Least Restriction for Offline Reinforcement Learning | Jul 5, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| The Pitfalls of Imitation Learning when Actions are Continuous | Mar 12, 2025 | ChunkingImitation Learning | —Unverified | 0 |
| The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning | Feb 27, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line | Aug 16, 2020 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| The Role of Coverage in Online Reinforcement Learning | Oct 9, 2022 | Efficient ExplorationOffline RL | —Unverified | 0 |
| The Role of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation | Jun 17, 2024 | Offline RL | —Unverified | 0 |
| The Value of Reward Lookahead in Reinforcement Learning | Mar 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| The Virtues of Pessimism in Inverse Reinforcement Learning | Feb 4, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| To Switch or Not to Switch? Balanced Policy Switching in Offline Reinforcement Learning | Jul 1, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers | Jun 16, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers | Apr 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Towards Generalizable Reinforcement Learning for Trade Execution | May 12, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Towards Instance-Optimal Offline Reinforcement Learning with Pessimism | Oct 17, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Towards Optimal Differentially Private Regret Bounds in Linear MDPs | Apr 12, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning | Mar 9, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses | May 18, 2024 | D4RLOffline RL | —Unverified | 0 |
| Tractable Offline Learning of Regular Decision Processes | Sep 4, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear q^π-Realizability and Concentrability | May 27, 2024 | Computational EfficiencyOffline RL | —Unverified | 0 |
| Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding | Feb 23, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Transferred Q-learning | Feb 9, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning | Jun 5, 2024 | D4RLOffline RL | —Unverified | 0 |
| UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning | Nov 22, 2021 | Decision MakingOffline RL | —Unverified | 0 |
| Uncertainty-Aware Decision Transformer for Stochastic Driving Environments | Sep 28, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Uncertainty-aware Distributional Offline Reinforcement Learning | Mar 26, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Uncertainty Regularized Policy Learning for Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Uncertainty Weighted Offline Reinforcement Learning | Jan 1, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization | Mar 31, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning | May 20, 2025 | MathOffline RL | —Unverified | 0 |
| Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents | Apr 3, 2023 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Unsupervised-to-Online Reinforcement Learning | Aug 27, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing | Jun 20, 2024 | Autonomous DrivingData Augmentation | —Unverified | 0 |
| User-Interactive Offline Reinforcement Learning | May 21, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Adaptive Q-Aid for Conditional Supervised Learning in Offline Reinforcement Learning | Feb 3, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Value Penalized Q-Learning for Recommender Systems | Oct 15, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Variational oracle guiding for reinforcement learning | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach | May 10, 2025 | Autonomous DrivingOffline RL | —Unverified | 0 |
| VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning | Apr 16, 2025 | D4RLOffline RL | —Unverified | 0 |
| Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning | Nov 6, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap | Jun 20, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| What are the Statistical Limits of Offline RL with Linear Function Approximation? | Oct 22, 2020 | Decision MakingOffline RL | —Unverified | 0 |
| What Matters for Batch Online Reinforcement Learning in Robotics? | May 12, 2025 | Imitation LearningOffline RL | —Unverified | 0 |
| When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning? | Apr 12, 2022 | Atari GamesDiagnostic | —Unverified | 0 |
| Which Features are Best for Successor Features? | Feb 15, 2025 | Offline RL | —Unverified | 0 |
| Why Online Reinforcement Learning is Causal | Mar 7, 2024 | counterfactualOffline RL | —Unverified | 0 |
| Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |