| BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning | Jul 15, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning | Feb 6, 2025 | Dataset GenerationMuJoCo | —Unverified | 0 | 0 |
| Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL | Jun 16, 2021 | D4RLDomain Generalization | —Unverified | 0 | 0 |
| Behavior Regularized Offline Reinforcement Learning | Nov 26, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Behaviour Discovery and Attribution for Explainable Reinforcement Learning | Mar 19, 2025 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Bellman Residual Orthogonalization for Offline Reinforcement Learning | Mar 24, 2022 | Offline RLOff-policy evaluation | —Unverified | 0 | 0 |
| Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation | Dec 5, 2022 | BenchmarkingBinary Classification | —Unverified | 0 | 0 |
| Benchmarks and Algorithms for Offline Preference-Based Reward Learning | Jan 3, 2023 | Active LearningOffline RL | —Unverified | 0 | 0 |
| Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators | Jun 30, 2024 | Autonomous VehiclesOffline RL | —Unverified | 0 | 0 |
| Bi-Level Offline Policy Optimization with Limited Exploration | Oct 10, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |