Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling May 14, 2023 Off-policy evaluation
— Unverified 0Learning Action Embeddings for Off-Policy Evaluation May 6, 2023 Off-policy evaluation
Code Code Available 0Conformal Off-Policy Evaluation in Markov Decision Processes Apr 5, 2023 Conformal Prediction Off-policy evaluation
— Unverified 0On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples Mar 7, 2023 Offline RL Off-policy evaluation
— Unverified 0Hallucinated Adversarial Control for Conservative Offline Policy Evaluation Mar 2, 2023 continuous-control Continuous Control
Code Code Available 0Balanced Off-Policy Evaluation for Personalized Pricing Feb 24, 2023 Off-policy evaluation
Code Code Available 0HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare Feb 18, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Post Reinforcement Learning Inference Feb 17, 2023 counterfactual Off-policy evaluation
Code Code Available 0STEEL: Singularity-aware Reinforcement Learning Jan 30, 2023 Off-policy evaluation reinforcement-learning
— Unverified 0Variational Latent Branching Model for Off-Policy Evaluation Jan 28, 2023 model Off-policy evaluation
Code Code Available 0Off-Policy Evaluation for Action-Dependent Non-Stationary Environments Jan 24, 2023 counterfactual Counterfactual Reasoning
Code Code Available 0Off-Policy Evaluation with Out-of-Sample Guarantees Jan 20, 2023 Off-policy evaluation valid
Code Code Available 0Inference on Time Series Nonparametric Conditional Moment Restrictions Using General Sieves Dec 31, 2022 Off-policy evaluation Time Series
— Unverified 0An Instrumental Variable Approach to Confounded Off-Policy Evaluation Dec 29, 2022 Decision Making Off-policy evaluation
— Unverified 0Quantile Off-Policy Evaluation via Deep Conditional Generative Learning Dec 29, 2022 Decision Making Off-policy evaluation
— Unverified 0Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information Dec 23, 2022 Decision Making Off-policy evaluation
— Unverified 0Safe Evaluation For Offline Learning: Are We Ready To Deploy? Dec 16, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction Dec 14, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0A Review of Off-Policy Evaluation in Reinforcement Learning Dec 13, 2022 Off-policy evaluation reinforcement-learning
— Unverified 0Doubly Robust Kernel Statistics for Testing Distributional Treatment Effects Dec 9, 2022 Causal Inference counterfactual
Code Code Available 0Low Variance Off-policy Evaluation with State-based Importance Sampling Dec 7, 2022 Density Ratio Estimation Off-policy evaluation
Code Code Available 0Counterfactual Learning with General Data-generating Policies Dec 4, 2022 counterfactual Decision Making
— Unverified 0Offline Policy Evaluation and Optimization under Confounding Nov 29, 2022 Offline RL Off-policy evaluation
— Unverified 0Policy-Adaptive Estimator Selection for Off-Policy Evaluation Nov 25, 2022 counterfactual Off-policy evaluation
Code Code Available 0Counterfactual Learning with Multioutput Deep Kernels Nov 20, 2022 counterfactual Counterfactual Inference
Code Code Available 0Bayesian Counterfactual Mean Embeddings and Off-Policy Evaluation Nov 2, 2022 counterfactual Off-policy evaluation
— Unverified 0Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions Oct 27, 2022 Off-policy evaluation
— Unverified 0Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions Oct 24, 2022 Metric Learning Multi-Armed Bandits
Code Code Available 0Off-policy evaluation for learning-to-rank via interpolating the item-position model and the position-based model Oct 15, 2022 Learning-To-Rank model
— Unverified 0Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models Sep 21, 2022 Causal Inference Off-policy evaluation
— Unverified 0Towards Robust Off-Policy Evaluation via Human Inputs Sep 18, 2022 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes Sep 16, 2022 Decision Making Metric Learning
— Unverified 0On the Reuse Bias in Off-Policy Reinforcement Learning Sep 15, 2022 continuous-control Continuous Control
Code Code Available 0Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach Sep 12, 2022 Off-policy evaluation
— Unverified 0Future-Dependent Value-Based Off-Policy Evaluation in POMDPs Jul 26, 2022 Off-policy evaluation
Code Code Available 0Conformal Off-policy Prediction Jun 14, 2022 Conformal Prediction Off-policy evaluation
Code Code Available 0Conformal Off-Policy Prediction in Contextual Bandits Jun 9, 2022 Conformal Prediction Multi-Armed Bandits
— Unverified 0Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks Jun 6, 2022 Off-policy evaluation
— Unverified 0Markovian Interference in Experiments Jun 6, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning Jun 4, 2022 MuJoCo Off-policy evaluation
— Unverified 0Counterfactual Analysis in Dynamic Latent State Models May 27, 2022 counterfactual Epidemiology
— Unverified 0Scalable and Robust Self-Learning for Skill Routing in Large-Scale Conversational AI Systems Apr 14, 2022 Off-policy evaluation Self-Learning
— Unverified 0Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments Apr 7, 2022 Off-policy evaluation
— Unverified 0Model-Free and Model-Based Policy Evaluation when Causality is Uncertain Apr 2, 2022 model Off-policy evaluation
Code Code Available 0Marginalized Operators for Off-policy Reinforcement Learning Mar 30, 2022 Off-policy evaluation reinforcement-learning
— Unverified 0Bellman Residual Orthogonalization for Offline Reinforcement Learning Mar 24, 2022 Offline RL Off-policy evaluation
— Unverified 0Off-Policy Evaluation in Embedded Spaces Mar 5, 2022 Density Ratio Estimation Off-policy evaluation
— Unverified 0Off-Policy Evaluation with Policy-Dependent Optimization Response Feb 25, 2022 Causal Inference Decision Making
— Unverified 0A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets Feb 21, 2022 Management Multi-agent Reinforcement Learning
Code Code Available 0Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory Feb 10, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0