| Neural Network Approximation for Pessimistic Offline Reinforcement Learning | Dec 19, 2023 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning | Dec 19, 2023 | NavigateOffline RL | —Unverified | 0 |
| Advancing RAN Slicing with Offline Reinforcement Learning | Dec 16, 2023 | ManagementOffline RL | —Unverified | 0 |
| Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach | Dec 12, 2023 | Knowledge DistillationOffline RL | CodeCode Available | 1 |
| A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning | Dec 12, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| The Generalization Gap in Offline Reinforcement Learning | Dec 10, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization | Dec 7, 2023 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 |
| MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator | Dec 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Evaluation of Active Feature Acquisition Methods for Static Feature Settings | Dec 6, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Diffused Task-Agnostic Milestone Planner | Dec 6, 2023 | Decision MakingOffline RL | —Unverified | 0 |
| H-GAP: Humanoid Control with a Generalist Planner | Dec 5, 2023 | Humanoid ControlModel Predictive Control | —Unverified | 0 |
| SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation | Nov 30, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 1 |
| Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective | Nov 29, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Self-Driving Telescopes: Autonomous Scheduling of Astronomical Observation Campaigns with Offline Reinforcement Learning | Nov 29, 2023 | AstronomyOffline RL | —Unverified | 0 |
| A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning | Nov 27, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets | Nov 19, 2023 | ManagementOffline RL | —Unverified | 0 |
| Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees | Nov 14, 2023 | Offline RL | CodeCode Available | 0 |
| Rethinking Decision Transformer via Hierarchical Reinforcement Learning | Nov 1, 2023 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving | Oct 31, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| A Tractable Inference Perspective of Offline RL | Oct 31, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| Offline RL with Observation Histories: Analyzing and Improving Sample Complexity | Oct 31, 2023 | Autonomous NavigationOffline RL | —Unverified | 0 |
| Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning | Oct 31, 2023 | Few-Shot LearningOffline RL | CodeCode Available | 1 |
| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Oct 30, 2023 | Decision MakingOffline RL | CodeCode Available | 1 |
| Robust Offline Reinforcement learning with Heavy-Tailed Rewards | Oct 28, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 |
| Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage | Oct 27, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |