SOTAVerified

Benchmarking

Papers

Showing 19912000 of 5548 papers

TitleStatusHype
How far are today's time-series models from real-world weather forecasting applications?Code2
The Elusive Pursuit of Reproducing PATE-GAN: Benchmarking, Auditing, DebuggingCode0
Benchmarking Monocular 3D Dog Pose Estimation Using In-The-Wild Motion Capture Data0
African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object ClassificationCode1
HoTPP Benchmark: Are We Good at the Long Horizon Events Forecasting?Code2
Resource-efficient Medical Image Analysis with Self-adapting Forward-Forward Networks0
DASB -- Discrete Audio and Speech Benchmark0
A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular DataCode1
FairX: A comprehensive benchmarking tool for model analysis using fairness, utility, and explainabilityCode0
PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions0
Show:102550
← PrevPage 200 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified