SOTAVerified

Benchmarking

Papers

Showing 15611570 of 5548 papers

TitleStatusHype
Knowing-how & Knowing-that: A New Task for Machine Comprehension of User ManualsCode0
Knowledge Enhanced Conditional Imputation for Healthcare Time-seriesCode0
Benchmarking Domain Generalization Algorithms in Computational PathologyCode0
SCoRE: Benchmarking Long-Chain Reasoning in Commonsense ScenariosCode0
Benchmarking Distributional Alignment of Large Language ModelsCode0
A novel evaluation methodology for supervised Feature Ranking algorithmsCode0
Benchmarking Differentially Private Residual Networks for Medical ImageryCode0
Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question AnsweringCode0
Benchmarking Dependence Measures to Prevent Shortcut Learning in Medical ImagingCode0
KamNet: An Integrated Spatiotemporal Deep Neural Network for Rare Event Search in KamLAND-ZenCode0
Show:102550
← PrevPage 157 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified