SOTAVerified

Benchmarking

Papers

Showing 29212930 of 5548 papers

TitleStatusHype
Distribution-Based Invariant Deep Networks for Learning Meta-Features0
Sensitivity analysis and experimental evaluation of PID-like continuous sliding mode control0
Diverse Community Data for Benchmarking Data Privacy Algorithms0
DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs (Extended)0
DLUE: Benchmarking Document Language Understanding0
DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs0
A Sober Look at the Robustness of CLIPs to Spurious Features0
Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields0
Does imputation matter? Benchmark for predictive models0
Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts0
Show:102550
← PrevPage 293 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified