A Review of Off-Policy Evaluation in Reinforcement Learning Dec 13, 2022 Off-policy evaluation reinforcement-learning
— Unverified 0Doubly Robust Kernel Statistics for Testing Distributional Treatment Effects Dec 9, 2022 Causal Inference counterfactual
Code Code Available 0Low Variance Off-policy Evaluation with State-based Importance Sampling Dec 7, 2022 Density Ratio Estimation Off-policy evaluation
Code Code Available 0Counterfactual Learning with General Data-generating Policies Dec 4, 2022 counterfactual Decision Making
— Unverified 0Offline Policy Evaluation and Optimization under Confounding Nov 29, 2022 Offline RL Off-policy evaluation
— Unverified 0Policy-Adaptive Estimator Selection for Off-Policy Evaluation Nov 25, 2022 counterfactual Off-policy evaluation
Code Code Available 0Counterfactual Learning with Multioutput Deep Kernels Nov 20, 2022 counterfactual Counterfactual Inference
Code Code Available 0Bayesian Counterfactual Mean Embeddings and Off-Policy Evaluation Nov 2, 2022 counterfactual Off-policy evaluation
— Unverified 0Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions Oct 27, 2022 Off-policy evaluation
— Unverified 0Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions Oct 24, 2022 Metric Learning Multi-Armed Bandits
Code Code Available 0Anytime-valid off-policy inference for contextual bandits Oct 19, 2022 counterfactual Multi-Armed Bandits
Code Code Available 1Off-policy evaluation for learning-to-rank via interpolating the item-position model and the position-based model Oct 15, 2022 Learning-To-Rank model
— Unverified 0A Policy-Guided Imitation Approach for Offline Reinforcement Learning Oct 15, 2022 D4RL Offline RL
Code Code Available 1Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models Sep 21, 2022 Causal Inference Off-policy evaluation
— Unverified 0Towards Robust Off-Policy Evaluation via Human Inputs Sep 18, 2022 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes Sep 16, 2022 Decision Making Metric Learning
— Unverified 0On the Reuse Bias in Off-Policy Reinforcement Learning Sep 15, 2022 continuous-control Continuous Control
Code Code Available 0Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach Sep 12, 2022 Off-policy evaluation
— Unverified 0Future-Dependent Value-Based Off-Policy Evaluation in POMDPs Jul 26, 2022 Off-policy evaluation
Code Code Available 0Conformal Off-policy Prediction Jun 14, 2022 Conformal Prediction Off-policy evaluation
Code Code Available 0Conformal Off-Policy Prediction in Contextual Bandits Jun 9, 2022 Conformal Prediction Multi-Armed Bandits
— Unverified 0Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks Jun 6, 2022 Off-policy evaluation
— Unverified 0Markovian Interference in Experiments Jun 6, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning Jun 4, 2022 MuJoCo Off-policy evaluation
— Unverified 0Counterfactual Analysis in Dynamic Latent State Models May 27, 2022 counterfactual Epidemiology
— Unverified 0COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation Apr 19, 2022 Offline RL Off-policy evaluation
Code Code Available 1Scalable and Robust Self-Learning for Skill Routing in Large-Scale Conversational AI Systems Apr 14, 2022 Off-policy evaluation Self-Learning
— Unverified 0Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments Apr 7, 2022 Off-policy evaluation
— Unverified 0Model-Free and Model-Based Policy Evaluation when Causality is Uncertain Apr 2, 2022 model Off-policy evaluation
Code Code Available 0Marginalized Operators for Off-policy Reinforcement Learning Mar 30, 2022 Off-policy evaluation reinforcement-learning
— Unverified 0Bellman Residual Orthogonalization for Offline Reinforcement Learning Mar 24, 2022 Offline RL Off-policy evaluation
— Unverified 0Off-Policy Evaluation in Embedded Spaces Mar 5, 2022 Density Ratio Estimation Off-policy evaluation
— Unverified 0Off-Policy Evaluation with Policy-Dependent Optimization Response Feb 25, 2022 Causal Inference Decision Making
— Unverified 0A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets Feb 21, 2022 Management Multi-agent Reinforcement Learning
Code Code Available 0Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning Feb 19, 2022 Off-policy evaluation
Code Code Available 1Off-Policy Evaluation for Large Action Spaces via Embeddings Feb 13, 2022 Multi-Armed Bandits Off-policy evaluation
Code Code Available 2Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory Feb 10, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model Feb 3, 2022 Multi-Armed Bandits Off-policy evaluation
Code Code Available 2Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making Jan 20, 2022 counterfactual Decision Making
— Unverified 0On Well-posedness and Minimax Optimal Rates of Nonparametric Q-function Estimation in Off-policy Evaluation Jan 17, 2022 Off-policy evaluation
— Unverified 0Off-Policy Evaluation Using Information Borrowing and Context-Based Switching Dec 18, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Optimal discharge of patients from intensive care via a data-driven policy learning framework Dec 17, 2021 Management Off-policy evaluation
— Unverified 0BCORLE(): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market Dec 1, 2021 Off-policy evaluation reinforcement-learning
Code Code Available 1Weighted model estimation for offline model-based reinforcement learning Dec 1, 2021 Density Ratio Estimation model
— Unverified 0Subgaussian and Differentiable Importance Sampling for Off-Policy Evaluation and Learning Dec 1, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Loss Functions for Discrete Contextual Pricing with Observational Data Nov 18, 2021 Management Off-policy evaluation
— Unverified 0A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes Nov 12, 2021 Off-policy evaluation
Code Code Available 0SOPE: Spectrum of Off-Policy Estimators Nov 6, 2021 Decision Making Off-policy evaluation
Code Code Available 0Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes Oct 28, 2021 Causal Inference Management
Code Code Available 0Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning Oct 26, 2021 Off-policy evaluation Open-Ended Question Answering
Code Code Available 0