| NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios | Mar 25, 2025 | BenchmarkingOffline RL | CodeCode Available | 1 |
| Behaviour Discovery and Attribution for Explainable Reinforcement Learning | Mar 19, 2025 | Offline RLreinforcement-learning | —Unverified | 0 |
| Evaluation-Time Policy Switching for Offline Reinforcement Learning | Mar 15, 2025 | Behavioural cloningOffline RL | —Unverified | 0 |
| The Pitfalls of Imitation Learning when Actions are Continuous | Mar 12, 2025 | ChunkingImitation Learning | —Unverified | 0 |
| Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning | Mar 10, 2025 | Imitation LearningOffline RL | —Unverified | 0 |
| Policy Constraint by Only Support Constraint for Offline Reinforcement Learning | Mar 7, 2025 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Energy-Weighted Flow Matching for Offline Reinforcement Learning | Mar 6, 2025 | Offline RLreinforcement-learning | —Unverified | 0 |
| What Makes a Good Diffusion Planner for Decision Making? | Mar 1, 2025 | Action GenerationDecision Making | CodeCode Available | 2 |
| Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Feb 28, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Yes, Q-learning Helps Offline In-Context RL | Feb 24, 2025 | In-Context Reinforcement LearningMuJoCo | —Unverified | 0 |
| Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective | Feb 17, 2025 | Bayesian Optimizationmodel | —Unverified | 0 |
| Which Features are Best for Successor Features? | Feb 15, 2025 | Offline RL | —Unverified | 0 |
| Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches | Feb 13, 2025 | D4RLOffline RL | —Unverified | 0 |
| Active Advantage-Aligned Online Reinforcement Learning with Offline Data | Feb 11, 2025 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits | Feb 7, 2025 | InformativenessOffline RL | —Unverified | 0 |
| Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning | Feb 6, 2025 | Dataset GenerationMuJoCo | —Unverified | 0 |
| OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds | Feb 5, 2025 | Few-Shot LearningImitation Learning | —Unverified | 0 |
| Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation | Feb 4, 2025 | feature selectionOffline RL | —Unverified | 0 |
| Flow Q-Learning | Feb 4, 2025 | Action GenerationD4RL | CodeCode Available | 3 |
| GNN-DT: Graph Neural Network Enhanced Decision Transformer for Efficient Optimization in Dynamic Environments | Feb 3, 2025 | Efficient ExplorationGraph Neural Network | CodeCode Available | 1 |
| Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning | Feb 3, 2025 | Meta-LearningOffline RL | —Unverified | 0 |
| Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback | Jan 27, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Data Center Cooling System Optimization Using Offline Reinforcement Learning | Jan 25, 2025 | Graph Neural NetworkOffline RL | —Unverified | 0 |
| Fat-to-Thin Policy Optimization: Offline RL with Sparse Policies | Jan 24, 2025 | MuJoCoOffline RL | CodeCode Available | 0 |
| Large Language Model driven Policy Exploration for Recommender Systems | Jan 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |