SOTAVerified

Benchmarking

Papers

Showing 15111520 of 5548 papers

TitleStatusHype
Benchmarking Framework for Performance-Evaluation of Causal Inference AnalysisCode0
Benchmarking framework for machine learning classification from fNIRS dataCode0
Benchmarking Foundation Models on Exceptional Cases: Dataset Creation and ValidationCode0
Knowledge Enhanced Conditional Imputation for Healthcare Time-seriesCode0
SCoRE: Benchmarking Long-Chain Reasoning in Commonsense ScenariosCode0
LABCAT: Locally adaptive Bayesian optimization using principal-component-aligned trust regionsCode0
A Position Paper on the Automatic Generation of Machine Learning LeaderboardsCode0
ADVIO: An authentic dataset for visual-inertial odometryCode0
Knowing-how & Knowing-that: A New Task for Machine Comprehension of User ManualsCode0
ApisTox: a new benchmark dataset for the classification of small molecules toxicity on honey beesCode0
Show:102550
← PrevPage 152 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified