Distributional Off-Policy Evaluation for Slate Recommendations Aug 27, 2023 Fairness Off-policy evaluation
Code Code Available 0Distributional Off-policy Evaluation with Bellman Residual Minimization Feb 2, 2024 Distributional Reinforcement Learning Off-policy evaluation
Code Code Available 0Robust Generalization despite Distribution Shift via Minimum Discriminating Information Jun 8, 2021 Generalization Bounds Off-policy evaluation
Code Code Available 0DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects May 2, 2025 Imputation Off-policy evaluation
Code Code Available 0Robust Offline Reinforcement learning with Heavy-Tailed Rewards Oct 28, 2023 Offline RL Off-policy evaluation
Code Code Available 0Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes Aug 22, 2019 Off-policy evaluation reinforcement-learning
Code Code Available 0Variational Latent Branching Model for Off-Policy Evaluation Jan 28, 2023 model Off-policy evaluation
Code Code Available 0Off-Policy Evaluation with Out-of-Sample Guarantees Jan 20, 2023 Off-policy evaluation valid
Code Code Available 0Counterfactual Learning with Multioutput Deep Kernels Nov 20, 2022 counterfactual Counterfactual Inference
Code Code Available 0Doubly Robust Estimator for Off-Policy Evaluation with Large Action Spaces Aug 7, 2023 Off-policy evaluation
Code Code Available 0Doubly Robust Kernel Statistics for Testing Distributional Treatment Effects Dec 9, 2022 Causal Inference counterfactual
Code Code Available 0Counterfactual Evaluation of Peer-Review Assignment Policies May 27, 2023 counterfactual Off-policy evaluation
Code Code Available 0Doubly robust off-policy evaluation with shrinkage Jul 22, 2019 Model Selection Multi-Armed Bandits
Code Code Available 0Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach Feb 20, 2021 Model-based Reinforcement Learning Off-policy evaluation
Code Code Available 0Conformal Off-policy Prediction Jun 14, 2022 Conformal Prediction Off-policy evaluation
Code Code Available 0Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes Mar 29, 2024 Off-policy evaluation
Code Code Available 0Safe Exploration for Optimizing Contextual Bandits Feb 2, 2020 counterfactual Information Retrieval
Code Code Available 0Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning Oct 26, 2021 Off-policy evaluation Open-Ended Question Answering
Code Code Available 0Adaptive Estimator Selection for Off-Policy Evaluation Feb 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Off-Policy Evaluation and Learning for External Validity under a Covariate Shift Feb 26, 2020 Off-policy evaluation
Code Code Available 0On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-n Recommendation Jul 27, 2023 Information Retrieval Off-policy evaluation
Code Code Available 0Strictly Batch Imitation Learning by Energy-based Distribution Matching Jun 25, 2020 Imitation Learning Off-policy evaluation
Code Code Available 0Off-Policy Evaluation for Action-Dependent Non-Stationary Environments Jan 24, 2023 counterfactual Counterfactual Reasoning
Code Code Available 0Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation Oct 26, 2023 counterfactual Off-policy evaluation
Code Code Available 0On the Reuse Bias in Off-Policy Reinforcement Learning Sep 15, 2022 continuous-control Continuous Control
Code Code Available 0Subgaussian and Differentiable Importance Sampling for Off-Policy Evaluation and Learning Dec 1, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Off-policy Evaluation with Deeply-abstracted States Jun 27, 2024 Off-policy evaluation
Code Code Available 0From Importance Sampling to Doubly Robust Policy Gradient Oct 20, 2019 Off-policy evaluation
Code Code Available 0Future-Dependent Value-Based Off-Policy Evaluation in POMDPs Jul 26, 2022 Off-policy evaluation
Code Code Available 0Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting Jun 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation Oct 3, 2024 Autonomous Driving Off-policy evaluation
Code Code Available 0Hallucinated Adversarial Control for Conservative Offline Policy Evaluation Mar 2, 2023 continuous-control Continuous Control
Code Code Available 0Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity Nov 5, 2020 Diversity Off-policy evaluation
Code Code Available 0Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning Jul 21, 2023 Decision Making Deep Reinforcement Learning
Code Code Available 0Supervised Off-Policy Ranking Jul 3, 2021 Off-policy evaluation
Code Code Available 0Human Choice Prediction in Language-based Persuasion Games: Simulation-based Off-Policy Evaluation May 17, 2023 Decision Making Off-policy evaluation
Code Code Available 0Optimal and Adaptive Off-policy Evaluation in Contextual Bandits Dec 4, 2016 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Balanced Off-Policy Evaluation for Personalized Pricing Feb 24, 2023 Off-policy evaluation
Code Code Available 0Importance Sampling Policy Evaluation with an Estimated Behavior Policy Jun 4, 2018 Off-policy evaluation
Code Code Available 0A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes Nov 12, 2021 Off-policy evaluation
Code Code Available 0Semi-Parametric Efficient Policy Learning with Continuous Actions May 24, 2019 Off-policy evaluation
Code Code Available 0Balanced off-policy evaluation in general action spaces Jun 9, 2019 Binary Classification counterfactual
Code Code Available 0Off-policy evaluation for slate recommendation May 16, 2016 Learning-To-Rank Off-policy evaluation
Code Code Available 0Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning Jun 9, 2019 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies May 29, 2024 Metric Learning Off-policy evaluation
Code Code Available 0K-Nearest-Neighbor Resampling for Off-Policy Evaluation in Stochastic Control Jun 7, 2023 counterfactual Off-policy evaluation
Code Code Available 0Policy-Adaptive Estimator Selection for Off-Policy Evaluation Nov 25, 2022 counterfactual Off-policy evaluation
Code Code Available 0Learning Action Embeddings for Off-Policy Evaluation May 6, 2023 Off-policy evaluation
Code Code Available 0Off-policy Evaluation in Doubly Inhomogeneous Environments Jun 14, 2023 Offline RL Off-policy evaluation
Code Code Available 0Leveraging Factored Action Spaces for Off-Policy Evaluation Jul 13, 2023 counterfactual Off-policy evaluation
Code Code Available 0