Off-Policy Evaluation in Markov Decision Processes under Weak Distributional Overlap Feb 13, 2024 Off-policy evaluation
— Unverified 0Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction Feb 3, 2024 Marketing Multi-Armed Bandits
Code Code Available 0Distributional Off-policy Evaluation with Bellman Residual Minimization Feb 2, 2024 Distributional Reinforcement Learning Off-policy evaluation
Code Code Available 0Probabilistic Offline Policy Ranking with Approximate Bayesian Computation Dec 17, 2023 Off-policy evaluation
— Unverified 0RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits Dec 3, 2023 Causal Inference Multi-Armed Bandits
Code Code Available 0Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation Nov 30, 2023 Benchmarking counterfactual
Code Code Available 1SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation Nov 30, 2023 Offline RL Off-policy evaluation
Code Code Available 1When is Off-Policy Evaluation (Reward Modeling) Useful in Contextual Bandits? A Data-Centric Perspective Nov 23, 2023 Large Language Model Multi-Armed Bandits
Code Code Available 0Unbiased Offline Evaluation for Learning to Rank with Business Rules Nov 3, 2023 Learning-To-Rank Off-policy evaluation
— Unverified 0Robust Offline Reinforcement learning with Heavy-Tailed Rewards Oct 28, 2023 Offline RL Off-policy evaluation
Code Code Available 0State-Action Similarity-Based Representations for Off-Policy Evaluation Oct 27, 2023 Off-policy evaluation Representation Learning
Code Code Available 0Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation Oct 26, 2023 counterfactual Off-policy evaluation
Code Code Available 0Off-Policy Evaluation for Large Action Spaces via Policy Convolution Oct 24, 2023 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks Oct 16, 2023 Off-policy evaluation reinforcement-learning
— Unverified 0Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization Oct 15, 2023 Multi-agent Reinforcement Learning Off-policy evaluation
— Unverified 0Off-Policy Evaluation for Human Feedback Oct 11, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework Sep 23, 2023 Off-policy evaluation
— Unverified 0Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits Sep 15, 2023 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning Aug 28, 2023 D4RL Off-policy evaluation
— Unverified 0Distributional Off-Policy Evaluation for Slate Recommendations Aug 27, 2023 Fairness Off-policy evaluation
Code Code Available 0Doubly Robust Estimator for Off-Policy Evaluation with Large Action Spaces Aug 7, 2023 Off-policy evaluation
Code Code Available 0On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-n Recommendation Jul 27, 2023 Information Retrieval Off-policy evaluation
Code Code Available 0The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation Jul 25, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning Jul 21, 2023 Decision Making Deep Reinforcement Learning
Code Code Available 0Leveraging Factored Action Spaces for Off-Policy Evaluation Jul 13, 2023 counterfactual Off-policy evaluation
Code Code Available 0Off-Policy Evaluation of Ranking Policies under Diverse User Behavior Jun 26, 2023 Off-policy evaluation
Code Code Available 1Off-policy Evaluation in Doubly Inhomogeneous Environments Jun 14, 2023 Offline RL Off-policy evaluation
Code Code Available 0K-Nearest-Neighbor Resampling for Off-Policy Evaluation in Stochastic Control Jun 7, 2023 counterfactual Off-policy evaluation
Code Code Available 0Counterfactual Evaluation of Peer-Review Assignment Policies May 27, 2023 counterfactual Off-policy evaluation
Code Code Available 0Scalable and Safe Remediation of Defective Actions in Self-Learning Conversational Systems May 17, 2023 Off-policy evaluation regression
— Unverified 0Human Choice Prediction in Language-based Persuasion Games: Simulation-based Off-Policy Evaluation May 17, 2023 Decision Making Off-policy evaluation
Code Code Available 0Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling May 14, 2023 Off-policy evaluation
— Unverified 0Learning Action Embeddings for Off-Policy Evaluation May 6, 2023 Off-policy evaluation
Code Code Available 0Conformal Off-Policy Evaluation in Markov Decision Processes Apr 5, 2023 Conformal Prediction Off-policy evaluation
— Unverified 0On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples Mar 7, 2023 Offline RL Off-policy evaluation
— Unverified 0Hallucinated Adversarial Control for Conservative Offline Policy Evaluation Mar 2, 2023 continuous-control Continuous Control
Code Code Available 0Balanced Off-Policy Evaluation for Personalized Pricing Feb 24, 2023 Off-policy evaluation
Code Code Available 0HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare Feb 18, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Post Reinforcement Learning Inference Feb 17, 2023 counterfactual Off-policy evaluation
Code Code Available 0STEEL: Singularity-aware Reinforcement Learning Jan 30, 2023 Off-policy evaluation reinforcement-learning
— Unverified 0Variational Latent Branching Model for Off-Policy Evaluation Jan 28, 2023 model Off-policy evaluation
Code Code Available 0Off-Policy Evaluation for Action-Dependent Non-Stationary Environments Jan 24, 2023 counterfactual Counterfactual Reasoning
Code Code Available 0Off-Policy Evaluation with Out-of-Sample Guarantees Jan 20, 2023 Off-policy evaluation valid
Code Code Available 0Inference on Time Series Nonparametric Conditional Moment Restrictions Using General Sieves Dec 31, 2022 Off-policy evaluation Time Series
— Unverified 0An Instrumental Variable Approach to Confounded Off-Policy Evaluation Dec 29, 2022 Decision Making Off-policy evaluation
— Unverified 0Quantile Off-Policy Evaluation via Deep Conditional Generative Learning Dec 29, 2022 Decision Making Off-policy evaluation
— Unverified 0Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information Dec 23, 2022 Decision Making Off-policy evaluation
— Unverified 0Safe Evaluation For Offline Learning: Are We Ready To Deploy? Dec 16, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction Dec 14, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0