Control Variates for Slate Off-Policy Evaluation Jun 15, 2021 Off-policy evaluation Recommendation Systems
Code Code Available 05 Off-Policy Evaluation for Action-Dependent Non-Stationary Environments Jan 24, 2023 counterfactual Counterfactual Reasoning
Code Code Available 05 Adaptive Estimator Selection for Off-Policy Evaluation Feb 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation Oct 26, 2023 counterfactual Off-policy evaluation
Code Code Available 05 More Robust Doubly Robust Off-policy Evaluation Feb 10, 2018 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes Aug 22, 2019 Off-policy evaluation reinforcement-learning
Code Code Available 05 Model-Free and Model-Based Policy Evaluation when Causality is Uncertain Apr 2, 2022 model Off-policy evaluation
Code Code Available 05 Counterfactual Learning with Multioutput Deep Kernels Nov 20, 2022 counterfactual Counterfactual Inference
Code Code Available 05 Off-Policy Evaluation with Out-of-Sample Guarantees Jan 20, 2023 Off-policy evaluation valid
Code Code Available 05 Predictive Performance Comparison of Decision Policies Under Confounding Apr 1, 2024 Causal Inference Decision Making
Code Code Available 05 Long-term Off-Policy Evaluation and Learning Apr 24, 2024 Off-policy evaluation
Code Code Available 05 Cross-Validated Off-Policy Evaluation May 24, 2024 Model Selection Off-policy evaluation
Code Code Available 05 Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions Oct 24, 2022 Metric Learning Multi-Armed Bandits
Code Code Available 05 Low Variance Off-policy Evaluation with State-based Importance Sampling Dec 7, 2022 Density Ratio Estimation Off-policy evaluation
Code Code Available 05 Learning Action Embeddings for Off-Policy Evaluation May 6, 2023 Off-policy evaluation
Code Code Available 05 Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies May 29, 2024 Metric Learning Off-policy evaluation
Code Code Available 05 Leveraging Factored Action Spaces for Off-Policy Evaluation Jul 13, 2023 counterfactual Off-policy evaluation
Code Code Available 05 Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits Dec 3, 2023 Causal Inference Multi-Armed Bandits
Code Code Available 05 Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity Nov 5, 2020 Diversity Off-policy evaluation
Code Code Available 05 Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation Jun 7, 2021 Off-policy evaluation
Code Code Available 05 Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning Jul 21, 2023 Decision Making Deep Reinforcement Learning
Code Code Available 05 Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning Jun 9, 2019 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 K-Nearest-Neighbor Resampling for Off-Policy Evaluation in Stochastic Control Jun 7, 2023 counterfactual Off-policy evaluation
Code Code Available 05 Distributional Off-Policy Evaluation for Slate Recommendations Aug 27, 2023 Fairness Off-policy evaluation
Code Code Available 05 Distributional Off-policy Evaluation with Bellman Residual Minimization Feb 2, 2024 Distributional Reinforcement Learning Off-policy evaluation
Code Code Available 05 Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences Jul 25, 2024 Off-policy evaluation
Code Code Available 05 DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects May 2, 2025 Imputation Off-policy evaluation
Code Code Available 05 Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning May 23, 2024 Off-policy evaluation
Code Code Available 05 Deeply-Debiased Off-Policy Interval Estimation May 10, 2021 Off-policy evaluation
Code Code Available 05 Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings Oct 29, 2020 Change Point Detection Off-policy evaluation
Code Code Available 05 Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation Oct 3, 2024 Autonomous Driving Off-policy evaluation
Code Code Available 05 Human Choice Prediction in Language-based Persuasion Games: Simulation-based Off-Policy Evaluation May 17, 2023 Decision Making Off-policy evaluation
Code Code Available 05 From Importance Sampling to Doubly Robust Policy Gradient Oct 20, 2019 Off-policy evaluation
Code Code Available 05 Doubly Robust Kernel Statistics for Testing Distributional Treatment Effects Dec 9, 2022 Causal Inference counterfactual
Code Code Available 05 Future-Dependent Value-Based Off-Policy Evaluation in POMDPs Jul 26, 2022 Off-policy evaluation
Code Code Available 05 Doubly robust off-policy evaluation with shrinkage Jul 22, 2019 Model Selection Multi-Armed Bandits
Code Code Available 05 A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes Nov 12, 2021 Off-policy evaluation
Code Code Available 05 Off-policy evaluation for slate recommendation May 16, 2016 Learning-To-Rank Off-policy evaluation
Code Code Available 05 Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes Mar 29, 2024 Off-policy evaluation
Code Code Available 05 RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting Jun 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Off-policy Evaluation with Deeply-abstracted States Jun 27, 2024 Off-policy evaluation
Code Code Available 05 Hallucinated Adversarial Control for Conservative Offline Policy Evaluation Mar 2, 2023 continuous-control Continuous Control
Code Code Available 05 Optimal and Adaptive Off-policy Evaluation in Contextual Bandits Dec 4, 2016 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Importance Sampling Policy Evaluation with an Estimated Behavior Policy Jun 4, 2018 Off-policy evaluation
Code Code Available 05 Minimum Empirical Divergence for Sub-Gaussian Linear Bandits Oct 31, 2024 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes Oct 28, 2021 Causal Inference Management
Code Code Available 05 A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets Feb 21, 2022 Management Multi-agent Reinforcement Learning
Code Code Available 05 Balanced Off-Policy Evaluation for Personalized Pricing Feb 24, 2023 Off-policy evaluation
Code Code Available 05 Universal Off-Policy Evaluation Apr 26, 2021 counterfactual Decision Making
Code Code Available 05