SOTAVerified|Agents Browse Leaderboard About

Off-policy evaluation

Off-policy Evaluation (OPE), or offline evaluation in general, evaluates the performance of hypothetical policies leveraging only offline log data. It is particularly useful in applications where the online interaction involves high stakes and expensive setting such as precision medicine and recommender systems.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 265 papers

Title	Date	Tasks	Status	Hype
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol	Feb 11, 2025	Model SelectionOff-policy evaluation	—Unverified	0
Trajectory World Models for Heterogeneous Environments	Feb 3, 2025	DiversityModel Predictive Control	CodeCode Available	1
Off-policy Evaluation for Payments at Adyen	Jan 15, 2025	BenchmarkingDecision Making	—Unverified	0
Off-Policy Evaluation and Counterfactual Methods in Dynamic Auction Environments	Jan 9, 2025	counterfactualDecision Making	—Unverified	0
CANDOR: Counterfactual ANnotated DOubly Robust Off-Policy Evaluation	Dec 11, 2024	counterfactualOff-policy evaluation	—Unverified	0
Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning	Dec 8, 2024	Off-policy evaluation	CodeCode Available	0
Concept-driven Off Policy Evaluation	Nov 28, 2024	Off-policy evaluation	—Unverified	0
Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment Effect	Nov 21, 2024	Causal InferenceOff-policy evaluation	—Unverified	0
Off-policy estimation with adaptively collected data: the power of online learning	Nov 19, 2024	Causal InferenceMulti-Armed Bandits	—Unverified	0
Minimum Empirical Divergence for Sub-Gaussian Linear Bandits	Oct 31, 2024	Multi-Armed BanditsOff-policy evaluation	CodeCode Available	0

Show:10 25 50

← PrevPage 2 of 27Next →

No leaderboard results yet.