| Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Dec 5, 2024 | D4RLOffline RL | —Unverified | 0 |
| Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback | Dec 3, 2024 | ObjectOffline RL | —Unverified | 0 |
| Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic Perspective | Dec 2, 2024 | Density EstimationOffline RL | CodeCode Available | 2 |
| Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization | Nov 27, 2024 | Computational EfficiencyOffline RL | —Unverified | 0 |
| PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement | Nov 26, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading | Nov 26, 2024 | Offline RLparameter-efficient fine-tuning | CodeCode Available | 2 |
| LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble | Nov 26, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Preserving Expert-Level Privacy in Offline Reinforcement Learning | Nov 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Continual Task Learning through Adaptive Policy Self-Composition | Nov 18, 2024 | Continual LearningOffline RL | CodeCode Available | 0 |
| Doubly Mild Generalization for Offline Reinforcement Learning | Nov 12, 2024 | MuJoCoOffline RL | CodeCode Available | 1 |