SOTAVerified

Benchmarking

Papers

Showing 42414250 of 5548 papers

TitleStatusHype
MTLens: Machine Translation Output Debugging0
Hide and Seek: on the Stealthiness of Attacks against Deep Learning Systems0
NEWTS: A Corpus for News Topic-Focused Summarization0
bsnsing: A decision tree induction method based on recursive optimal boolean rule compositionCode0
AI-enabled Sound Pattern Recognition on Asthma Medication Adherence: Evaluation with the RDA Benchmark SuiteCode0
Benchmarking Unsupervised Anomaly Detection and Localization0
A Framework for Generating Informative Benchmark InstancesCode0
Bias Reduction via Cooperative Bargaining in Synthetic Graph Dataset GenerationCode0
Benchmarking of Deep Learning models on 2D Laminar Flow behind Cylinder0
Large Language Models are Few-Shot Clinical Information Extractors0
Show:102550
← PrevPage 425 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified