SOTAVerified

Benchmarking

Papers

Showing 15011510 of 5548 papers

TitleStatusHype
LaCViT: A Label-aware Contrastive Fine-tuning Framework for Vision TransformersCode0
Benchmarking GPT-4 against Human Translators: A Comprehensive Evaluation Across Languages, Domains, and Expertise LevelsCode0
LABCAT: Locally adaptive Bayesian optimization using principal-component-aligned trust regionsCode0
Language-based Image Colorization: A Benchmark and BeyondCode0
Benchmarking Generative Latent Variable Models for SpeechCode0
Benchmarking Generative AI Models for Deep Learning Test Input GenerationCode0
SCoRE: Benchmarking Long-Chain Reasoning in Commonsense ScenariosCode0
Knowledge Enhanced Conditional Imputation for Healthcare Time-seriesCode0
Benchmarking Framework for Performance-Evaluation of Causal Inference AnalysisCode0
Benchmarking framework for machine learning classification from fNIRS dataCode0
Show:102550
← PrevPage 151 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified