| Advancing RAN Slicing with Offline Reinforcement Learning | Dec 16, 2023 | ManagementOffline RL | —Unverified | 0 |
| A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning | Dec 12, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization | Dec 7, 2023 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 |
| MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator | Dec 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Diffused Task-Agnostic Milestone Planner | Dec 6, 2023 | Decision MakingOffline RL | —Unverified | 0 |
| Evaluation of Active Feature Acquisition Methods for Static Feature Settings | Dec 6, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| H-GAP: Humanoid Control with a Generalist Planner | Dec 5, 2023 | Humanoid ControlModel Predictive Control | —Unverified | 0 |
| Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective | Nov 29, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Self-Driving Telescopes: Autonomous Scheduling of Astronomical Observation Campaigns with Offline Reinforcement Learning | Nov 29, 2023 | AstronomyOffline RL | —Unverified | 0 |
| A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning | Nov 27, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets | Nov 19, 2023 | ManagementOffline RL | —Unverified | 0 |
| Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees | Nov 14, 2023 | Offline RL | CodeCode Available | 0 |
| Rethinking Decision Transformer via Hierarchical Reinforcement Learning | Nov 1, 2023 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Offline RL with Observation Histories: Analyzing and Improving Sample Complexity | Oct 31, 2023 | Autonomous NavigationOffline RL | —Unverified | 0 |
| Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving | Oct 31, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| A Tractable Inference Perspective of Offline RL | Oct 31, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| Robust Offline Reinforcement learning with Heavy-Tailed Rewards | Oct 28, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 |
| Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage | Oct 27, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning | Oct 27, 2023 | Autonomous DrivingD4RL | —Unverified | 0 |
| Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation | Oct 25, 2023 | Contrastive Learningmodel | —Unverified | 0 |
| Finetuning Offline World Models in the Real World | Oct 24, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Corruption-Robust Offline Reinforcement Learning with General Function Approximation | Oct 23, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning | Oct 18, 2023 | Offline RLQuantization | —Unverified | 0 |
| Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning | Oct 16, 2023 | ChatbotOffline RL | CodeCode Available | 0 |
| End-to-end Offline Reinforcement Learning for Glycemia Control | Oct 16, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |