Data-Driven Off-Policy Estimator Selection: An Application in User Marketing on An Online Content Delivery Service Sep 17, 2021 Decision Making Marketing
— Unverified 00 Data Poisoning Attacks on Off-Policy Policy Evaluation Methods Apr 6, 2024 Data Poisoning Off-policy evaluation
— Unverified 00 Debiasing Samples from Online Learning Using Bootstrap Jul 31, 2021 Off-policy evaluation Thompson Sampling
— Unverified 00 Deep Jump Q-Evaluation for Offline Policy Evaluation in Continuous Action Space Sep 28, 2020 Off-policy evaluation Q-Learning
— Unverified 00 Defining Admissible Rewards for High Confidence Policy Evaluation May 30, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 00 Designing an Interpretable Interface for Contextual Bandits Sep 23, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm Sep 24, 2024 Offline RL Off-policy evaluation
— Unverified 00 Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning Apr 20, 2021 Clustering Decision Making
— Unverified 00 Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework Sep 23, 2023 Off-policy evaluation
— Unverified 00 Double/Debiased Machine Learning for Dynamic Treatment Effects via g-Estimation Feb 17, 2020 BIG-bench Machine Learning Model Selection
— Unverified 00 Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation Jan 1, 2020 Off-policy evaluation reinforcement-learning
— Unverified 00 Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation Oct 16, 2019 Density Ratio Estimation Off-policy evaluation
— Unverified 00 Doubly-Robust Off-Policy Evaluation with Estimated Logging Policy Apr 2, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits Aug 20, 2024 Off-policy evaluation Recommendation Systems
— Unverified 00 Efficient Counterfactual Learning from Bandit Feedback Sep 10, 2018 Causal Inference counterfactual
— Unverified 00 Efficient Evaluation of Natural Stochastic Policies in Offline Reinforcement Learning Jun 6, 2020 Off-policy evaluation reinforcement-learning
— Unverified 00 Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning Sep 12, 2019 Off-policy evaluation reinforcement-learning
— Unverified 00 Efron-Stein PAC-Bayesian Inequalities Sep 4, 2019 Generalization Bounds Off-policy evaluation
— Unverified 00 Emphatic TD Bellman Operator is a Contraction Aug 14, 2015 Off-policy evaluation
— Unverified 00 Empowering Clinicians with Medical Decision Transformers: A Framework for Sepsis Treatment Jul 28, 2024 Off-policy evaluation reinforcement-learning
— Unverified 00 Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective Feb 17, 2025 Bayesian Optimization model
— Unverified 00 Expected Sarsa(λ) with Control Variate for Variance Reduction Jun 25, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 00 Finite Sample Analysis of Minimax Offline Reinforcement Learning: Completeness, Fast Rates and First-Order Efficiency Feb 5, 2021 Off-policy evaluation reinforcement-learning
— Unverified 00 Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis Sep 17, 2015 Off-policy evaluation
— Unverified 00 Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making Jan 20, 2022 counterfactual Decision Making
— Unverified 00 HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare Feb 18, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning Jun 4, 2022 MuJoCo Off-policy evaluation
— Unverified 00 Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It Apr 23, 2024 counterfactual Decision Making
— Unverified 00 Inference on Time Series Nonparametric Conditional Moment Restrictions Using General Sieves Dec 31, 2022 Off-policy evaluation Time Series
— Unverified 00 Infinite-Horizon Offline Reinforcement Learning with Linear Function Approximation: Curse of Dimensionality and Algorithm Mar 17, 2021 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions Feb 10, 2020 Off-policy evaluation reinforcement-learning
— Unverified 00 IntOPE: Off-Policy Evaluation in the Presence of Interference Aug 24, 2024 Off-policy evaluation Recommendation Systems
— Unverified 00 Large-scale Validation of Counterfactual Learning Methods: A Test-Bed Dec 1, 2016 counterfactual Off-policy evaluation
— Unverified 00 Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments Apr 7, 2022 Off-policy evaluation
— Unverified 00 Limit Order Book Simulation and Trade Evaluation with K-Nearest-Neighbor Resampling Sep 10, 2024 Off-policy evaluation
— Unverified 00 Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment Effect Nov 21, 2024 Causal Inference Off-policy evaluation
— Unverified 00 Loss Functions for Discrete Contextual Pricing with Observational Data Nov 18, 2021 Management Off-policy evaluation
— Unverified 00 Marginalized Operators for Off-policy Reinforcement Learning Mar 30, 2022 Off-policy evaluation reinforcement-learning
— Unverified 00 Markovian Interference in Experiments Jun 6, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation Apr 3, 2024 Off-policy evaluation reinforcement-learning
— Unverified 00 Minimax Value Interval for Off-Policy Evaluation and Policy Optimization Feb 6, 2020 Efficient Exploration Off-policy evaluation
— Unverified 00 Minimax Model Learning Mar 2, 2021 model Model-based Reinforcement Learning
— Unverified 00 Minimax Off-Policy Evaluation for Multi-Armed Bandits Jan 19, 2021 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation Feb 21, 2020 Off-policy evaluation Reinforcement Learning
— Unverified 00 Minimax Weight and Q-Function Learning for Off-Policy Evaluation Oct 28, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 00 Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization Oct 15, 2023 Multi-agent Reinforcement Learning Off-policy evaluation
— Unverified 00 Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol Feb 11, 2025 Model Selection Off-policy evaluation
— Unverified 00 More Efficient Off-Policy Evaluation through Regularized Targeted Learning Dec 13, 2019 Causal Inference Off-policy evaluation
— Unverified 00 Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds Mar 9, 2021 Off-policy evaluation Open-Ended Question Answering
— Unverified 00 Offline Comparison of Ranking Functions using Randomized Data Oct 11, 2018 Off-policy evaluation
— Unverified 00