Off-Policy Evaluation and Learning for the Future under Non-Stationarity Jun 25, 2025 Off-policy evaluation
— Unverified 0A Principled Path to Fitted Distributional Evaluation Jun 24, 2025 Atari Games Off-policy evaluation
— Unverified 0Semi-gradient DICE for Offline Constrained Reinforcement Learning Jun 10, 2025 Offline RL Off-policy evaluation
— Unverified 0STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation May 27, 2025 D4RL Denoising
— Unverified 0Characterization of Efficient Influence Function for Off-Policy Evaluation Under Optimal Policies May 20, 2025 counterfactual Off-policy evaluation
— Unverified 0DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects May 2, 2025 Imputation Off-policy evaluation
Code Code Available 0Stabilizing Temporal Difference Learning via Implicit Stochastic Recursion May 2, 2025 Computational Efficiency Off-policy evaluation
— Unverified 0Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding Apr 1, 2025 Decision Making Off-policy evaluation
— Unverified 0Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective Feb 17, 2025 Bayesian Optimization model
— Unverified 0Off-Policy Evaluation for Recommendations with Missing-Not-At-Random Rewards Feb 13, 2025 Off-policy evaluation Position
— Unverified 0Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol Feb 11, 2025 Model Selection Off-policy evaluation
— Unverified 0Trajectory World Models for Heterogeneous Environments Feb 3, 2025 Diversity Model Predictive Control
Code Code Available 1Off-policy Evaluation for Payments at Adyen Jan 15, 2025 Benchmarking Decision Making
— Unverified 0Off-Policy Evaluation and Counterfactual Methods in Dynamic Auction Environments Jan 9, 2025 counterfactual Decision Making
— Unverified 0CANDOR: Counterfactual ANnotated DOubly Robust Off-Policy Evaluation Dec 11, 2024 counterfactual Off-policy evaluation
— Unverified 0Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning Dec 8, 2024 Off-policy evaluation
Code Code Available 0Concept-driven Off Policy Evaluation Nov 28, 2024 Off-policy evaluation
— Unverified 0Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment Effect Nov 21, 2024 Causal Inference Off-policy evaluation
— Unverified 0Off-policy estimation with adaptively collected data: the power of online learning Nov 19, 2024 Causal Inference Multi-Armed Bandits
— Unverified 0Minimum Empirical Divergence for Sub-Gaussian Linear Bandits Oct 31, 2024 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Primal-Dual Spectral Representation for Off-policy Evaluation Oct 23, 2024 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation Oct 3, 2024 Autonomous Driving Off-policy evaluation
Code Code Available 0Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm Sep 24, 2024 Offline RL Off-policy evaluation
— Unverified 0Designing an Interpretable Interface for Contextual Bandits Sep 23, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Limit Order Book Simulation and Trade Evaluation with K-Nearest-Neighbor Resampling Sep 10, 2024 Off-policy evaluation
— Unverified 0IntOPE: Off-Policy Evaluation in the Presence of Interference Aug 24, 2024 Off-policy evaluation Recommendation Systems
— Unverified 0Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits Aug 20, 2024 Off-policy evaluation Recommendation Systems
— Unverified 0Empowering Clinicians with Medical Decision Transformers: A Framework for Sepsis Treatment Jul 28, 2024 Off-policy evaluation reinforcement-learning
— Unverified 0Causal Deepsets for Off-policy Evaluation under Spatial or Spatio-temporal Interferences Jul 25, 2024 Off-policy evaluation
Code Code Available 0Balancing Immediate Revenue and Future Off-Policy Evaluation in Coupon Allocation Jul 6, 2024 Off-policy evaluation
— Unverified 0Off-policy Evaluation with Deeply-abstracted States Jun 27, 2024 Off-policy evaluation
Code Code Available 0Confident Natural Policy Gradient for Local Planning in q_π-realizable Constrained MDPs Jun 26, 2024 Off-policy evaluation
— Unverified 0Automated Off-Policy Estimator Selection via Supervised Learning Jun 26, 2024 counterfactual Off-policy evaluation
— Unverified 0Off-Policy Evaluation from Logged Human Feedback Jun 14, 2024 Off-policy evaluation
— Unverified 0RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation Jun 3, 2024 LEMMA Off-policy evaluation
— Unverified 0A Fast Convergence Theory for Offline Decision Making Jun 3, 2024 Decision Making Offline RL
— Unverified 0Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies May 29, 2024 Metric Learning Off-policy evaluation
Code Code Available 0OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators May 27, 2024 Decision Making Offline RL
— Unverified 0Cross-Validated Off-Policy Evaluation May 24, 2024 Model Selection Off-policy evaluation
Code Code Available 0Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning May 23, 2024 Off-policy evaluation
Code Code Available 0Long-term Off-Policy Evaluation and Learning Apr 24, 2024 Off-policy evaluation
Code Code Available 0Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It Apr 23, 2024 counterfactual Decision Making
— Unverified 0Data Poisoning Attacks on Off-Policy Policy Evaluation Methods Apr 6, 2024 Data Poisoning Off-policy evaluation
— Unverified 0Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation Apr 3, 2024 Off-policy evaluation reinforcement-learning
— Unverified 0Doubly-Robust Off-Policy Evaluation with Estimated Logging Policy Apr 2, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Predictive Performance Comparison of Decision Policies Under Confounding Apr 1, 2024 Causal Inference Decision Making
Code Code Available 0Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes Mar 29, 2024 Off-policy evaluation
Code Code Available 0Cramming Contextual Bandits for On-policy Statistical Evaluation Mar 11, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Bayesian Off-Policy Evaluation and Learning for Large Action Spaces Feb 22, 2024 Computational Efficiency Off-policy evaluation
— Unverified 0On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation Feb 22, 2024 Off-policy evaluation
— Unverified 0