Off-Policy Evaluation for Large Action Spaces via Embeddings Feb 13, 2022 Multi-Armed Bandits Off-policy evaluation
Code Code Available 2Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model Feb 3, 2022 Multi-Armed Bandits Off-policy evaluation
Code Code Available 2Evaluating the Robustness of Off-Policy Evaluation Aug 31, 2021 Off-policy evaluation Recommendation Systems
Code Code Available 1COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation Apr 19, 2022 Offline RL Off-policy evaluation
Code Code Available 1Anytime-valid off-policy inference for contextual bandits Oct 19, 2022 counterfactual Multi-Armed Bandits
Code Code Available 1BCORLE(): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market Dec 1, 2021 Off-policy evaluation reinforcement-learning
Code Code Available 1Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation Nov 30, 2023 Benchmarking counterfactual
Code Code Available 1Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning Feb 19, 2022 Off-policy evaluation
Code Code Available 1Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation Jun 24, 2021 Meta Reinforcement Learning Off-policy evaluation
Code Code Available 1Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits Jun 3, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 1Benchmarks for Deep Off-Policy Evaluation Mar 30, 2021 Benchmarking continuous-control
Code Code Available 1A Policy-Guided Imitation Approach for Offline Reinforcement Learning Oct 15, 2022 D4RL Offline RL
Code Code Available 1Offline RL Without Off-Policy Evaluation Jun 16, 2021 D4RL Offline RL
Code Code Available 1A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation Jun 12, 2021 Deep Reinforcement Learning MuJoCo
Code Code Available 1Trajectory World Models for Heterogeneous Environments Feb 3, 2025 Diversity Model Predictive Control
Code Code Available 1Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings Jul 23, 2021 Computational Efficiency Decision Making
Code Code Available 1Off-Policy Evaluation of Ranking Policies under Diverse User Behavior Jun 26, 2023 Off-policy evaluation
Code Code Available 1SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation Nov 30, 2023 Offline RL Off-policy evaluation
Code Code Available 1Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation Aug 17, 2020 Off-policy evaluation
Code Code Available 1Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions Jul 25, 2020 counterfactual News Recommendation
Code Code Available 1Active Offline Policy Selection Jun 18, 2021 Bayesian Optimization Off-policy evaluation
Code Code Available 1Optimal Off-Policy Evaluation from Multiple Logging Policies Oct 21, 2020 Off-policy evaluation
Code Code Available 1Bayesian Off-Policy Evaluation and Learning for Large Action Spaces Feb 22, 2024 Computational Efficiency Off-policy evaluation
— Unverified 0Adaptive Trade-Offs in Off-Policy Learning Oct 16, 2019 Off-policy evaluation reinforcement-learning
— Unverified 0Debiasing Samples from Online Learning Using Bootstrap Jul 31, 2021 Off-policy evaluation Thompson Sampling
— Unverified 0Balancing Immediate Revenue and Future Off-Policy Evaluation in Coupon Allocation Jul 6, 2024 Off-policy evaluation
— Unverified 0An Instrumental Variable Approach to Confounded Off-Policy Evaluation Dec 29, 2022 Decision Making Off-policy evaluation
— Unverified 0Bayesian Counterfactual Mean Embeddings and Off-Policy Evaluation Nov 2, 2022 counterfactual Off-policy evaluation
— Unverified 0Data-Driven Off-Policy Estimator Selection: An Application in User Marketing on An Online Content Delivery Service Sep 17, 2021 Decision Making Marketing
— Unverified 0Accountable Off-Policy Evaluation via a Kernelized Bellman Statistics Jan 1, 2020 Off-policy evaluation
— Unverified 0Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization Apr 28, 2021 continuous-control Continuous Control
— Unverified 0Counterfactual Learning with General Data-generating Policies Dec 4, 2022 counterfactual Decision Making
— Unverified 0Data Poisoning Attacks on Off-Policy Policy Evaluation Methods Apr 6, 2024 Data Poisoning Off-policy evaluation
— Unverified 0Deep Jump Q-Evaluation for Offline Policy Evaluation in Continuous Action Space Sep 28, 2020 Off-policy evaluation Q-Learning
— Unverified 0A maximum-entropy approach to off-policy evaluation in average-reward MDPs Jun 17, 2020 Off-policy evaluation
— Unverified 0A Unified Off-Policy Evaluation Approach for General Value Function Jul 6, 2021 Anomaly Detection Off-policy evaluation
— Unverified 0Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation Sep 17, 2021 Decision Making Offline RL
— Unverified 0Combining Parametric and Nonparametric Models for Off-Policy Evaluation May 14, 2019 Mixture-of-Experts Off-policy evaluation
— Unverified 0A Fast Convergence Theory for Offline Decision Making Jun 3, 2024 Decision Making Offline RL
— Unverified 0CoinDICE: Off-Policy Confidence Interval Estimation Oct 22, 2020 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Characterization of Efficient Influence Function for Off-Policy Evaluation Under Optimal Policies May 20, 2025 counterfactual Off-policy evaluation
— Unverified 0Concept-driven Off Policy Evaluation Nov 28, 2024 Off-policy evaluation
— Unverified 0Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales Jun 12, 2020 Off-policy evaluation
— Unverified 0Confident Natural Policy Gradient for Local Planning in q_π-realizable Constrained MDPs Jun 26, 2024 Off-policy evaluation
— Unverified 0Automated Off-Policy Estimator Selection via Supervised Learning Jun 26, 2024 counterfactual Off-policy evaluation
— Unverified 0Conformal Off-Policy Evaluation in Markov Decision Processes Apr 5, 2023 Conformal Prediction Off-policy evaluation
— Unverified 0Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning Jan 29, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Conformal Off-Policy Prediction in Contextual Bandits Jun 9, 2022 Conformal Prediction Multi-Armed Bandits
— Unverified 0Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning Feb 11, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains Jun 3, 2020 Autonomous Driving Causal Inference
— Unverified 0