Deep Jump Q-Evaluation for Offline Policy Evaluation in Continuous Action Space Sep 28, 2020 Off-policy evaluation Q-Learning
— Unverified 0Accountable Off-Policy Evaluation With Kernel Bellman Statistics Aug 15, 2020 Medical Diagnosis Off-policy evaluation
— Unverified 0Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation Jul 27, 2020 continuous-control Continuous Control
— Unverified 0Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders Jul 27, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Off-Policy Evaluation via the Regularized Lagrangian Jul 7, 2020 Off-policy evaluation
— Unverified 0Off-Policy Exploitability-Evaluation in Two-Player Zero-Sum Markov Games Jul 4, 2020 Off-policy evaluation Vocal Bursts Valence Prediction
— Unverified 0Strictly Batch Imitation Learning by Energy-based Distribution Matching Jun 25, 2020 Imitation Learning Off-policy evaluation
Code Code Available 0Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting Jun 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0A maximum-entropy approach to off-policy evaluation in average-reward MDPs Jun 17, 2020 Off-policy evaluation
— Unverified 0Confidence Interval for Off-Policy Evaluation from Dependent Samples via Bandit Algorithm: Approach from Standardized Martingales Jun 12, 2020 Off-policy evaluation
— Unverified 0Efficient Evaluation of Natural Stochastic Policies in Offline Reinforcement Learning Jun 6, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains Jun 3, 2020 Autonomous Driving Causal Inference
— Unverified 0Taylor Expansion Policy Optimization Mar 13, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Batch Stationary Distribution Estimation Mar 2, 2020 Off-policy evaluation
Code Code Available 0Off-Policy Evaluation and Learning for External Validity under a Covariate Shift Feb 26, 2020 Off-policy evaluation
Code Code Available 0Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation Feb 21, 2020 Off-policy evaluation Reinforcement Learning
— Unverified 0Debiased Off-Policy Evaluation for Recommendation Systems Feb 20, 2020 counterfactual Off-policy evaluation
— Unverified 0Adaptive Estimator Selection for Off-Policy Evaluation Feb 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Double/Debiased Machine Learning for Dynamic Treatment Effects via g-Estimation Feb 17, 2020 BIG-bench Machine Learning Model Selection
— Unverified 0Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning Feb 11, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions Feb 10, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Minimax Value Interval for Off-Policy Evaluation and Policy Optimization Feb 6, 2020 Efficient Exploration Off-policy evaluation
— Unverified 0Safe Exploration for Optimizing Contextual Bandits Feb 2, 2020 counterfactual Information Retrieval
Code Code Available 0Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning Jan 29, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation Jan 1, 2020 Off-policy evaluation reinforcement-learning
— Unverified 0Accountable Off-Policy Evaluation via a Kernelized Bellman Statistics Jan 1, 2020 Off-policy evaluation
— Unverified 0More Efficient Off-Policy Evaluation through Regularized Targeted Learning Dec 13, 2019 Causal Inference Off-policy evaluation
— Unverified 0Triply Robust Off-Policy Evaluation Nov 13, 2019 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Minimax Weight and Q-Function Learning for Off-Policy Evaluation Oct 28, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 0From Importance Sampling to Doubly Robust Policy Gradient Oct 20, 2019 Off-policy evaluation
Code Code Available 0Adaptive Trade-Offs in Off-Policy Learning Oct 16, 2019 Off-policy evaluation reinforcement-learning
— Unverified 0Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation Oct 16, 2019 Density Ratio Estimation Off-policy evaluation
— Unverified 0Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling Oct 15, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 0Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning Sep 12, 2019 Off-policy evaluation reinforcement-learning
— Unverified 0Off-Policy Evaluation in Partially Observable Environments Sep 9, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 0Efron-Stein PAC-Bayesian Inequalities Sep 4, 2019 Generalization Bounds Off-policy evaluation
— Unverified 0Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes Aug 22, 2019 Off-policy evaluation reinforcement-learning
Code Code Available 0Doubly robust off-policy evaluation with shrinkage Jul 22, 2019 Model Selection Multi-Armed Bandits
Code Code Available 0Task Selection Policies for Multitask Learning Jul 14, 2019 counterfactual Natural Language Understanding
— Unverified 0Expected Sarsa(λ) with Control Variate for Variance Reduction Jun 25, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 0Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning Jun 9, 2019 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Balanced off-policy evaluation in general action spaces Jun 9, 2019 Binary Classification counterfactual
Code Code Available 0Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling Jun 8, 2019 Off-policy evaluation reinforcement-learning
— Unverified 0Off-Policy Evaluation via Off-Policy Classification Jun 4, 2019 Classification Deep Reinforcement Learning
— Unverified 0Defining Admissible Rewards for High Confidence Policy Evaluation May 30, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 0Semi-Parametric Efficient Policy Learning with Continuous Actions May 24, 2019 Off-policy evaluation
Code Code Available 0Combining Parametric and Nonparametric Models for Off-Policy Evaluation May 14, 2019 Mixture-of-Experts Off-policy evaluation
— Unverified 0Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models May 14, 2019 counterfactual Management
Code Code Available 0Privacy Preserving Off-Policy Evaluation Feb 1, 2019 Off-policy evaluation Privacy Preserving
— Unverified 0Off-Policy Evaluation of Probabilistic Identity Data in Lookalike Modeling Jan 4, 2019 Marketing Off-policy evaluation
— Unverified 0