Balanced off-policy evaluation in general action spaces Jun 9, 2019 Binary Classification counterfactual
Code Code Available 05 Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings Oct 29, 2020 Change Point Detection Off-policy evaluation
Code Code Available 05 Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding Apr 1, 2025 Decision Making Off-policy evaluation
— Unverified 00 Off-Policy Evaluation from Logged Human Feedback Jun 14, 2024 Off-policy evaluation
— Unverified 00 Off-Policy Evaluation in Embedded Spaces Mar 5, 2022 Density Ratio Estimation Off-policy evaluation
— Unverified 00 Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders Jul 27, 2020 Off-policy evaluation reinforcement-learning
— Unverified 00 Off-Policy Evaluation in Markov Decision Processes under Weak Distributional Overlap Feb 13, 2024 Off-policy evaluation
— Unverified 00 Off-Policy Evaluation in Partially Observable Environments Sep 9, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 00 Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential Ignorability Oct 24, 2021 Off-policy evaluation
— Unverified 00 Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under Batch Update Policy Oct 23, 2020 Off-policy evaluation
— Unverified 00 Off-Policy Evaluation of Probabilistic Identity Data in Lookalike Modeling Jan 4, 2019 Marketing Off-policy evaluation
— Unverified 00 Off-Policy Evaluation of Slate Policies under Bayes Risk Jan 5, 2021 Off-policy evaluation
— Unverified 00 Off-Policy Evaluation via Off-Policy Classification Jun 4, 2019 Classification Deep Reinforcement Learning
— Unverified 00 Off-Policy Evaluation via the Regularized Lagrangian Jul 7, 2020 Off-policy evaluation
— Unverified 00 Off-Policy Evaluation with Policy-Dependent Optimization Response Feb 25, 2022 Causal Inference Decision Making
— Unverified 00 Off-Policy Exploitability-Evaluation in Two-Player Zero-Sum Markov Games Jul 4, 2020 Off-policy evaluation Vocal Bursts Valence Prediction
— Unverified 00 Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory Feb 10, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Off-Policy Interval Estimation with Lipschitz Value Iteration Oct 29, 2020 Decision Making Medical Diagnosis
— Unverified 00 Off-Policy Risk Assessment in Contextual Bandits Apr 18, 2021 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Online Learning for Recommendations at Grubhub Jul 15, 2021 Incremental Learning Off-policy evaluation
— Unverified 00 On Minimax Optimal Offline Policy Evaluation Sep 12, 2014 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation Feb 22, 2024 Off-policy evaluation
— Unverified 00 On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples Mar 7, 2023 Offline RL Off-policy evaluation
— Unverified 00 On Well-posedness and Minimax Optimal Rates of Nonparametric Q-function Estimation in Off-policy Evaluation Jan 17, 2022 Off-policy evaluation
— Unverified 00 OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators May 27, 2024 Decision Making Offline RL
— Unverified 00 Optimal discharge of patients from intensive care via a data-driven policy learning framework Dec 17, 2021 Management Off-policy evaluation
— Unverified 00 Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies Nov 29, 2020 Off-policy evaluation Recommendation Systems
— Unverified 00 Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling Jun 8, 2019 Off-policy evaluation reinforcement-learning
— Unverified 00 Practical Marginalized Importance Sampling with the Successor Representation Jan 1, 2021 Deep Reinforcement Learning MuJoCo
— Unverified 00 Primal-Dual Spectral Representation for Off-policy Evaluation Oct 23, 2024 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Privacy Preserving Off-Policy Evaluation Feb 1, 2019 Off-policy evaluation Privacy Preserving
— Unverified 00 Probabilistic Offline Policy Ranking with Approximate Bayesian Computation Dec 17, 2023 Off-policy evaluation
— Unverified 00 Quantile Off-Policy Evaluation via Deep Conditional Generative Learning Dec 29, 2022 Decision Making Off-policy evaluation
— Unverified 00 Reliable Off-policy Evaluation for Reinforcement Learning Nov 8, 2020 Decision Making Off-policy evaluation
— Unverified 00 RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation Jun 3, 2024 LEMMA Off-policy evaluation
— Unverified 00 Debiased Off-Policy Evaluation for Recommendation Systems Feb 20, 2020 counterfactual Off-policy evaluation
— Unverified 00 Safe Evaluation For Offline Learning: Are We Ready To Deploy? Dec 16, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks Jun 6, 2022 Off-policy evaluation
— Unverified 00 Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks Oct 16, 2023 Off-policy evaluation reinforcement-learning
— Unverified 00 Scalable and Robust Self-Learning for Skill Routing in Large-Scale Conversational AI Systems Apr 14, 2022 Off-policy evaluation Self-Learning
— Unverified 00 Scalable and Safe Remediation of Defective Actions in Self-Learning Conversational Systems May 17, 2023 Off-policy evaluation regression
— Unverified 00 Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction Dec 14, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Semi-gradient DICE for Offline Constrained Reinforcement Learning Jun 10, 2025 Offline RL Off-policy evaluation
— Unverified 00 STEEL: Singularity-aware Reinforcement Learning Jan 30, 2023 Off-policy evaluation reinforcement-learning
— Unverified 00 Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint Jan 6, 2021 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Stabilizing Temporal Difference Learning via Implicit Stochastic Recursion May 2, 2025 Computational Efficiency Off-policy evaluation
— Unverified 00 Stateful Offline Contextual Policy Evaluation and Learning Oct 19, 2021 Management Multi-Armed Bandits
— Unverified 00 Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation Jul 27, 2020 continuous-control Continuous Control
— Unverified 00 Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach Sep 12, 2022 Off-policy evaluation
— Unverified 00 Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning Aug 28, 2023 D4RL Off-policy evaluation
— Unverified 00