SOTAVerified

Benchmarking

Papers

Showing 39413950 of 5548 papers

TitleStatusHype
From Modern CNNs to Vision Transformers: Assessing the Performance, Robustness, and Classification Strategies of Deep Learning Models in HistopathologyCode0
Data Splits and Metrics for Method Benchmarking on Surgical Action Triplet DatasetsCode1
Metaethical Perspectives on 'Benchmarking' AI Ethics0
Benchmarking for Public Health Surveillance tasks on Social Media with a Domain-Specific Pretrained Language Model0
BioRED: A Rich Biomedical Relation Extraction DatasetCode1
Disability prediction in multiple sclerosis using performance outcome measures and demographic data0
tmVar 3.0: an improved variant concept recognition and normalization tool0
Deep Visual Geo-localization BenchmarkCode2
The Moral Integrity Corpus: A Benchmark for Ethical Dialogue SystemsCode1
CLEAVE: Scalable and Edge-native Benchmarking of Networked Control SystemsCode0
Show:102550
← PrevPage 395 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified