Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation Oct 16, 2019 Density Ratio Estimation Off-policy evaluation
— Unverified 0Doubly-Robust Off-Policy Evaluation with Estimated Logging Policy Apr 2, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits Aug 20, 2024 Off-policy evaluation Recommendation Systems
— Unverified 0Efficient Counterfactual Learning from Bandit Feedback Sep 10, 2018 Causal Inference counterfactual
— Unverified 0Efficient Evaluation of Natural Stochastic Policies in Offline Reinforcement Learning Jun 6, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning Sep 12, 2019 Off-policy evaluation reinforcement-learning
— Unverified 0Efron-Stein PAC-Bayesian Inequalities Sep 4, 2019 Generalization Bounds Off-policy evaluation
— Unverified 0Emphatic TD Bellman Operator is a Contraction Aug 14, 2015 Off-policy evaluation
— Unverified 0Empowering Clinicians with Medical Decision Transformers: A Framework for Sepsis Treatment Jul 28, 2024 Off-policy evaluation reinforcement-learning
— Unverified 0Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective Feb 17, 2025 Bayesian Optimization model
— Unverified 0Expected Sarsa(λ) with Control Variate for Variance Reduction Jun 25, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 0Finite Sample Analysis of Minimax Offline Reinforcement Learning: Completeness, Fast Rates and First-Order Efficiency Feb 5, 2021 Off-policy evaluation reinforcement-learning
— Unverified 0Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis Sep 17, 2015 Off-policy evaluation
— Unverified 0Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making Jan 20, 2022 counterfactual Decision Making
— Unverified 0HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare Feb 18, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning Jun 4, 2022 MuJoCo Off-policy evaluation
— Unverified 0Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It Apr 23, 2024 counterfactual Decision Making
— Unverified 0Inference on Time Series Nonparametric Conditional Moment Restrictions Using General Sieves Dec 31, 2022 Off-policy evaluation Time Series
— Unverified 0Infinite-Horizon Offline Reinforcement Learning with Linear Function Approximation: Curse of Dimensionality and Algorithm Mar 17, 2021 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions Feb 10, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0IntOPE: Off-Policy Evaluation in the Presence of Interference Aug 24, 2024 Off-policy evaluation Recommendation Systems
— Unverified 0Large-scale Validation of Counterfactual Learning Methods: A Test-Bed Dec 1, 2016 counterfactual Off-policy evaluation
— Unverified 0Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments Apr 7, 2022 Off-policy evaluation
— Unverified 0Limit Order Book Simulation and Trade Evaluation with K-Nearest-Neighbor Resampling Sep 10, 2024 Off-policy evaluation
— Unverified 0Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment Effect Nov 21, 2024 Causal Inference Off-policy evaluation
— Unverified 0Loss Functions for Discrete Contextual Pricing with Observational Data Nov 18, 2021 Management Off-policy evaluation
— Unverified 0Marginalized Operators for Off-policy Reinforcement Learning Mar 30, 2022 Off-policy evaluation reinforcement-learning
— Unverified 0Markovian Interference in Experiments Jun 6, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation Apr 3, 2024 Off-policy evaluation reinforcement-learning
— Unverified 0Minimax Value Interval for Off-Policy Evaluation and Policy Optimization Feb 6, 2020 Efficient Exploration Off-policy evaluation
— Unverified 0Minimax Model Learning Mar 2, 2021 model Model-based Reinforcement Learning
— Unverified 0Minimax Off-Policy Evaluation for Multi-Armed Bandits Jan 19, 2021 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation Feb 21, 2020 Off-policy evaluation Reinforcement Learning
— Unverified 0Minimax Weight and Q-Function Learning for Off-Policy Evaluation Oct 28, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 0Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization Oct 15, 2023 Multi-agent Reinforcement Learning Off-policy evaluation
— Unverified 0Counterfactual Mean Embeddings May 22, 2018 Causal Inference counterfactual
Code Code Available 0Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models May 14, 2019 counterfactual Management
Code Code Available 0Cross-Validated Off-Policy Evaluation May 24, 2024 Model Selection Off-policy evaluation
Code Code Available 0Off-Policy Evaluation Using Information Borrowing and Context-Based Switching Dec 18, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Minimum Empirical Divergence for Sub-Gaussian Linear Bandits Oct 31, 2024 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Batch Stationary Distribution Estimation Mar 2, 2020 Off-policy evaluation
Code Code Available 0Model-Free and Model-Based Policy Evaluation when Causality is Uncertain Apr 2, 2022 model Off-policy evaluation
Code Code Available 0Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings Oct 29, 2020 Change Point Detection Off-policy evaluation
Code Code Available 0Deeply-Debiased Off-Policy Interval Estimation May 10, 2021 Off-policy evaluation
Code Code Available 0Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation Jun 7, 2021 Off-policy evaluation
Code Code Available 0Control Variates for Slate Off-Policy Evaluation Jun 15, 2021 Off-policy evaluation Recommendation Systems
Code Code Available 0A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets Feb 21, 2022 Management Multi-agent Reinforcement Learning
Code Code Available 0State Relevance for Off-Policy Evaluation Sep 13, 2021 Off-policy evaluation
Code Code Available 0More Robust Doubly Robust Off-policy Evaluation Feb 10, 2018 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0