STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation May 27, 2025 D4RL Denoising
— Unverified 00 Task Selection Policies for Multitask Learning Jul 14, 2019 counterfactual Natural Language Understanding
— Unverified 00 Taylor Expansion Policy Optimization Mar 13, 2020 Off-policy evaluation reinforcement-learning
— Unverified 00 Cramming Contextual Bandits for On-policy Statistical Evaluation Mar 11, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation Jul 25, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes Sep 16, 2022 Decision Making Metric Learning
— Unverified 00 Towards Robust Off-Policy Evaluation via Human Inputs Sep 18, 2022 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Triply Robust Off-Policy Evaluation Nov 13, 2019 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Unbiased Offline Evaluation for Learning to Rank with Business Rules Nov 3, 2023 Learning-To-Rank Off-policy evaluation
— Unverified 00 Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling Oct 15, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 00 Variance-Aware Off-Policy Evaluation with Linear Function Approximation Jun 22, 2021 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits Sep 15, 2023 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Weighted model estimation for offline model-based reinforcement learning Dec 1, 2021 Density Ratio Estimation model
— Unverified 00 Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data Sep 29, 2021 Deep Reinforcement Learning Off-policy evaluation
— Unverified 00 Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation Sep 17, 2021 Decision Making Offline RL
— Unverified 00 Accountable Off-Policy Evaluation via a Kernelized Bellman Statistics Jan 1, 2020 Off-policy evaluation
— Unverified 00 Accountable Off-Policy Evaluation With Kernel Bellman Statistics Aug 15, 2020 Medical Diagnosis Off-policy evaluation
— Unverified 00 Adaptive Trade-Offs in Off-Policy Learning Oct 16, 2019 Off-policy evaluation reinforcement-learning
— Unverified 00 A maximum-entropy approach to off-policy evaluation in average-reward MDPs Jun 17, 2020 Off-policy evaluation
— Unverified 00 An Instrumental Variable Approach to Confounded Off-Policy Evaluation Dec 29, 2022 Decision Making Off-policy evaluation
— Unverified 00 A Practical Guide of Off-Policy Evaluation for Bandit Problems Oct 23, 2020 Off-policy evaluation
— Unverified 00 A Principled Path to Fitted Distributional Evaluation Jun 24, 2025 Atari Games Off-policy evaluation
— Unverified 00 A Review of Off-Policy Evaluation in Reinforcement Learning Dec 13, 2022 Off-policy evaluation reinforcement-learning
— Unverified 00 A Spectral Approach to Off-Policy Evaluation for POMDPs Sep 22, 2021 Causal Identification Off-policy evaluation
— Unverified 00 Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning Jan 29, 2020 Off-policy evaluation reinforcement-learning
— Unverified 00 A Fast Convergence Theory for Offline Decision Making Jun 3, 2024 Decision Making Offline RL
— Unverified 00 A Unified Off-Policy Evaluation Approach for General Value Function Jul 6, 2021 Anomaly Detection Off-policy evaluation
— Unverified 00 Automated Off-Policy Estimator Selection via Supervised Learning Jun 26, 2024 counterfactual Off-policy evaluation
— Unverified 00 Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization Apr 28, 2021 continuous-control Continuous Control
— Unverified 00 Balancing Immediate Revenue and Future Off-Policy Evaluation in Coupon Allocation Jul 6, 2024 Off-policy evaluation
— Unverified 00 Bayesian Counterfactual Mean Embeddings and Off-Policy Evaluation Nov 2, 2022 counterfactual Off-policy evaluation
— Unverified 00 Bayesian Off-Policy Evaluation and Learning for Large Action Spaces Feb 22, 2024 Computational Efficiency Off-policy evaluation
— Unverified 00 Bellman Residual Orthogonalization for Offline Reinforcement Learning Mar 24, 2022 Offline RL Off-policy evaluation
— Unverified 00 Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions Oct 27, 2022 Off-policy evaluation
— Unverified 00 Bootstrapping Fitted Q-Evaluation for Off-Policy Inference Feb 6, 2021 Off-policy evaluation
— Unverified 00 Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation Jun 20, 2016 Off-policy evaluation
— Unverified 00 CANDOR: Counterfactual ANnotated DOubly Robust Off-Policy Evaluation Dec 11, 2024 counterfactual Off-policy evaluation
— Unverified 00 Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains Jun 3, 2020 Autonomous Driving Causal Inference
— Unverified 00 Characterization of Efficient Influence Function for Off-Policy Evaluation Under Optimal Policies May 20, 2025 counterfactual Off-policy evaluation
— Unverified 00 CoinDICE: Off-Policy Confidence Interval Estimation Oct 22, 2020 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 00 Combining Parametric and Nonparametric Models for Off-Policy Evaluation May 14, 2019 Mixture-of-Experts Off-policy evaluation
— Unverified 00 Concept-driven Off Policy Evaluation Nov 28, 2024 Off-policy evaluation
— Unverified 00 Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales Jun 12, 2020 Off-policy evaluation
— Unverified 00 Confident Natural Policy Gradient for Local Planning in q_π-realizable Constrained MDPs Jun 26, 2024 Off-policy evaluation
— Unverified 00 Conformal Off-Policy Evaluation in Markov Decision Processes Apr 5, 2023 Conformal Prediction Off-policy evaluation
— Unverified 00 Conformal Off-Policy Prediction in Contextual Bandits Jun 9, 2022 Conformal Prediction Multi-Armed Bandits
— Unverified 00 Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning Feb 11, 2020 Off-policy evaluation reinforcement-learning
— Unverified 00 Consistent On-Line Off-Policy Evaluation Feb 23, 2017 Off-policy evaluation
— Unverified 00 Counterfactual Analysis in Dynamic Latent State Models May 27, 2022 counterfactual Epidemiology
— Unverified 00 Counterfactual Learning with General Data-generating Policies Dec 4, 2022 counterfactual Decision Making
— Unverified 00