SOTAVerified|Agents Browse Leaderboard About

Off-policy evaluation

Off-policy Evaluation (OPE), or offline evaluation in general, evaluates the performance of hypothetical policies leveraging only offline log data. It is particularly useful in applications where the online interaction involves high stakes and expensive setting such as precision medicine and recommender systems.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 265 papers

Title	Date	Tasks	Status	Hype	Score
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation	Apr 19, 2022	Offline RLOff-policy evaluation	CodeCode Available	1	5
Off-Policy Evaluation of Ranking Policies under Diverse User Behavior	Jun 26, 2023	Off-policy evaluation	CodeCode Available	1	5
Anytime-valid off-policy inference for contextual bandits	Oct 19, 2022	counterfactualMulti-Armed Bandits	CodeCode Available	1	5
A Policy-Guided Imitation Approach for Offline Reinforcement Learning	Oct 15, 2022	D4RLOffline RL	CodeCode Available	1	5
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation	Jun 12, 2021	Deep Reinforcement LearningMuJoCo	CodeCode Available	1	5
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions	Jul 25, 2020	counterfactualNews Recommendation	CodeCode Available	1	5
Evaluating the Robustness of Off-Policy Evaluation	Aug 31, 2021	Off-policy evaluationRecommendation Systems	CodeCode Available	1	5
Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning	Feb 19, 2022	Off-policy evaluation	CodeCode Available	1	5
Active Offline Policy Selection	Jun 18, 2021	Bayesian OptimizationOff-policy evaluation	CodeCode Available	1	5
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation	Nov 30, 2023	Benchmarkingcounterfactual	CodeCode Available	1	5

Show:10 25 50

← PrevPage 2 of 27Next →

No leaderboard results yet.