Off-Policy Evaluation for Large Action Spaces via Embeddings Feb 13, 2022 Multi-Armed Bandits Off-policy evaluation
Code Code Available 2Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model Feb 3, 2022 Multi-Armed Bandits Off-policy evaluation
Code Code Available 2Trajectory World Models for Heterogeneous Environments Feb 3, 2025 Diversity Model Predictive Control
Code Code Available 1SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation Nov 30, 2023 Offline RL Off-policy evaluation
Code Code Available 1Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation Nov 30, 2023 Benchmarking counterfactual
Code Code Available 1Off-Policy Evaluation of Ranking Policies under Diverse User Behavior Jun 26, 2023 Off-policy evaluation
Code Code Available 1Anytime-valid off-policy inference for contextual bandits Oct 19, 2022 counterfactual Multi-Armed Bandits
Code Code Available 1A Policy-Guided Imitation Approach for Offline Reinforcement Learning Oct 15, 2022 D4RL Offline RL
Code Code Available 1COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation Apr 19, 2022 Offline RL Off-policy evaluation
Code Code Available 1Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning Feb 19, 2022 Off-policy evaluation
Code Code Available 1BCORLE(): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market Dec 1, 2021 Off-policy evaluation reinforcement-learning
Code Code Available 1Evaluating the Robustness of Off-Policy Evaluation Aug 31, 2021 Off-policy evaluation Recommendation Systems
Code Code Available 1Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings Jul 23, 2021 Computational Efficiency Decision Making
Code Code Available 1Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation Jun 24, 2021 Meta Reinforcement Learning Off-policy evaluation
Code Code Available 1Active Offline Policy Selection Jun 18, 2021 Bayesian Optimization Off-policy evaluation
Code Code Available 1Offline RL Without Off-Policy Evaluation Jun 16, 2021 D4RL Offline RL
Code Code Available 1A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation Jun 12, 2021 Deep Reinforcement Learning MuJoCo
Code Code Available 1Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits Jun 3, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 1Benchmarks for Deep Off-Policy Evaluation Mar 30, 2021 Benchmarking continuous-control
Code Code Available 1Optimal Off-Policy Evaluation from Multiple Logging Policies Oct 21, 2020 Off-policy evaluation
Code Code Available 1Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation Aug 17, 2020 Off-policy evaluation
Code Code Available 1Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions Jul 25, 2020 counterfactual News Recommendation
Code Code Available 1Off-Policy Evaluation and Learning for the Future under Non-Stationarity Jun 25, 2025 Off-policy evaluation
— Unverified 0A Principled Path to Fitted Distributional Evaluation Jun 24, 2025 Atari Games Off-policy evaluation
— Unverified 0Semi-gradient DICE for Offline Constrained Reinforcement Learning Jun 10, 2025 Offline RL Off-policy evaluation
— Unverified 0STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation May 27, 2025 D4RL Denoising
— Unverified 0Characterization of Efficient Influence Function for Off-Policy Evaluation Under Optimal Policies May 20, 2025 counterfactual Off-policy evaluation
— Unverified 0Stabilizing Temporal Difference Learning via Implicit Stochastic Recursion May 2, 2025 Computational Efficiency Off-policy evaluation
— Unverified 0DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects May 2, 2025 Imputation Off-policy evaluation
Code Code Available 0Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding Apr 1, 2025 Decision Making Off-policy evaluation
— Unverified 0Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective Feb 17, 2025 Bayesian Optimization model
— Unverified 0Off-Policy Evaluation for Recommendations with Missing-Not-At-Random Rewards Feb 13, 2025 Off-policy evaluation Position
— Unverified 0Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol Feb 11, 2025 Model Selection Off-policy evaluation
— Unverified 0Off-policy Evaluation for Payments at Adyen Jan 15, 2025 Benchmarking Decision Making
— Unverified 0Off-Policy Evaluation and Counterfactual Methods in Dynamic Auction Environments Jan 9, 2025 counterfactual Decision Making
— Unverified 0CANDOR: Counterfactual ANnotated DOubly Robust Off-Policy Evaluation Dec 11, 2024 counterfactual Off-policy evaluation
— Unverified 0Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning Dec 8, 2024 Off-policy evaluation
Code Code Available 0Concept-driven Off Policy Evaluation Nov 28, 2024 Off-policy evaluation
— Unverified 0Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment Effect Nov 21, 2024 Causal Inference Off-policy evaluation
— Unverified 0Off-policy estimation with adaptively collected data: the power of online learning Nov 19, 2024 Causal Inference Multi-Armed Bandits
— Unverified 0Minimum Empirical Divergence for Sub-Gaussian Linear Bandits Oct 31, 2024 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Primal-Dual Spectral Representation for Off-policy Evaluation Oct 23, 2024 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation Oct 3, 2024 Autonomous Driving Off-policy evaluation
Code Code Available 0Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm Sep 24, 2024 Offline RL Off-policy evaluation
— Unverified 0Designing an Interpretable Interface for Contextual Bandits Sep 23, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Limit Order Book Simulation and Trade Evaluation with K-Nearest-Neighbor Resampling Sep 10, 2024 Off-policy evaluation
— Unverified 0IntOPE: Off-Policy Evaluation in the Presence of Interference Aug 24, 2024 Off-policy evaluation Recommendation Systems
— Unverified 0Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits Aug 20, 2024 Off-policy evaluation Recommendation Systems
— Unverified 0Empowering Clinicians with Medical Decision Transformers: A Framework for Sepsis Treatment Jul 28, 2024 Off-policy evaluation reinforcement-learning
— Unverified 0Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences Jul 25, 2024 Off-policy evaluation
Code Code Available 0