Offline Pre-trained Multi-Agent Decision Transformer Sep 29, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Offline Primal-Dual Reinforcement Learning for Linear MDPs May 22, 2023 Offline RL reinforcement-learning
— Unverified 0Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes Nov 28, 2022 Offline RL Q-Learning
— Unverified 0Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation Oct 30, 2024 Offline RL Q-Learning
— Unverified 0Offline Reinforcement Learning as Anti-Exploration Jun 11, 2021 continuous-control Continuous Control
— Unverified 0Offline Reinforcement Learning at Multiple Frequencies Jul 26, 2022 Offline RL reinforcement-learning
— Unverified 0Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information Dec 23, 2022 Decision Making Off-policy evaluation
— Unverified 0Offline reinforcement learning for job-shop scheduling problems Oct 21, 2024 Combinatorial Optimization Deep Learning
— Unverified 0Offline Reinforcement Learning for Large Scale Language Action Spaces Sep 29, 2021 Language Modeling Language Modelling
— Unverified 0Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management Feb 21, 2023 Dialogue Management Diversity
— Unverified 0Offline Reinforcement Learning for Mobile Notifications Feb 4, 2022 Attribute Recommendation Systems
— Unverified 0Offline Reinforcement Learning for Road Traffic Control Jan 7, 2022 Offline RL reinforcement-learning
— Unverified 0Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets Nov 19, 2023 Management Offline RL
— Unverified 0Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation Nov 21, 2021 Decision Making Offline RL
— Unverified 0Offline Reinforcement Learning Hands-On Nov 29, 2020 Behavioural cloning Decision Making
— Unverified 0Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps Mar 25, 2022 Offline RL Reinforcement Learning (RL)
— Unverified 0Offline Reinforcement Learning with Pseudometric Learning Mar 2, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Offline reinforcement learning with uncertainty for treatment strategies in sepsis Jul 9, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Offline Reinforcement Learning with Realizability and Single-policy Concentrability Feb 9, 2022 Offline RL reinforcement-learning
— Unverified 0Offline Reinforcement Learning with Differential Privacy Jun 2, 2022 Offline RL reinforcement-learning
— Unverified 0Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes Sep 18, 2022 Offline RL reinforcement-learning
— Unverified 0Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient Oct 3, 2022 Decision Making Offline RL
— Unverified 0Offline Reinforcement Learning with Imbalanced Datasets Jul 6, 2023 D4RL Offline RL
— Unverified 0Offline Reinforcement Learning with Behavioral Supervisor Tuning Apr 25, 2024 Offline RL reinforcement-learning
— Unverified 0Offline Reinforcement Learning with Adaptive Behavior Regularization Nov 15, 2022 D4RL Offline RL
— Unverified 0Offline Reinforcement Learning with Causal Structured World Models Jun 3, 2022 Model-based Reinforcement Learning Offline RL
— Unverified 0Offline Reinforcement Learning with Closed-Form Policy Improvement Operators Nov 29, 2022 D4RL Form
— Unverified 0Offline Reinforcement Learning with Discrete Diffusion Skills Mar 26, 2025 Decoder Offline RL
— Unverified 0Offline Reinforcement Learning with Fisher Divergence Critic Regularization Mar 14, 2021 Offline RL reinforcement-learning
— Unverified 0Offline Reinforcement Learning with On-Policy Q-Function Regularization Jul 25, 2023 D4RL reinforcement-learning
— Unverified 0Offline Reinforcement Learning with Resource Constrained Online Deployment Sep 29, 2021 D4RL Offline RL
— Unverified 0Offline Reinforcement Learning with Soft Behavior Regularization Oct 14, 2021 continuous-control Continuous Control
— Unverified 0Offline RL with Observation Histories: Analyzing and Improving Sample Complexity Oct 31, 2023 Autonomous Navigation Offline RL
— Unverified 0Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints Nov 2, 2022 Atari Games Offline RL
— Unverified 0Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator Apr 23, 2025 Offline RL Reinforcement Learning (RL)
— Unverified 0Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling Dec 16, 2022 MuJoCo Q-Learning
— Unverified 0Offline Trajectory Generalization for Offline Reinforcement Learning Apr 16, 2024 D4RL Data Augmentation
— Unverified 0Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks Dec 11, 2022 Deep Reinforcement Learning MuJoCo
— Unverified 0Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift Jan 27, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Off-Policy Evaluation for Human Feedback Oct 11, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders Jul 27, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Off-Policy Evaluation in Partially Observable Environments Sep 9, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 0Off-Policy Evaluation via Off-Policy Classification Jun 4, 2019 Classification Deep Reinforcement Learning
— Unverified 0Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory Feb 10, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Off-Policy Meta-Reinforcement Learning Based on Feature Embedding Spaces Jan 6, 2021 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift Nov 16, 2019 continuous-control Continuous Control
— Unverified 0Off-policy reinforcement learning for H_ control design Nov 24, 2013 reinforcement-learning Reinforcement Learning
— Unverified 0Off-Policy Reinforcement Learning with Delayed Rewards Jun 22, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction Oct 22, 2021 continuous-control Continuous Control
— Unverified 0Off-Policy Reinforcement Learning with High Dimensional Reward Aug 14, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0