SOTAVerified

Benchmarking

Papers

Showing 35113520 of 5548 papers

TitleStatusHype
NoisyHate: Mining Online Human-Written Perturbations for Realistic Robustness Benchmarking of Content Moderation Models0
Noisy intermediate-scale quantum (NISQ) algorithms0
InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System0
Non-Contextual Modeling of Sarcasm using a Neural Network Benchmark0
Non-Reference Quality Assessment for Medical Imaging: Application to Synthetic Brain MRIs0
Nonstochastic Bandits with Infinitely Many Experts0
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding0
Not Every Tree Is a Forest: Benchmarking Forest Types from Satellite Remote Sensing0
NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription0
NOVA: A Benchmark for Anomaly Localization and Clinical Reasoning in Brain MRI0
Show:102550
← PrevPage 352 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified