SOTAVerified

Benchmarking

Papers

Showing 13811390 of 5548 papers

TitleStatusHype
A Critical Assessment of State-of-the-Art in Entity AlignmentCode1
Benchmarking Deep Learning Interpretability in Time Series PredictionsCode1
Kvasir-Instrument: Diagnostic and therapeutic tool segmentation dataset in gastrointestinal endoscopyCode1
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and KirundiCode1
Exploiting News Article Structure for Automatic Corpus Generation of Entailment DatasetsCode1
Self-Alignment Pretraining for Biomedical Entity RepresentationsCode1
German's Next Language ModelCode1
Promoting High Diversity Ensemble Learning with EnsembleBenchCode1
RobustBench: a standardized adversarial robustness benchmarkCode1
RADIATE: A Radar Dataset for Automotive Perception in Bad WeatherCode1
Show:102550
← PrevPage 139 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified