SOTAVerified

Benchmarking

Papers

Showing 12511260 of 5548 papers

TitleStatusHype
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale DatasetCode1
SERAB: A multi-lingual benchmark for speech emotion recognitionCode1
EntQA: Entity Linking as Question AnsweringCode1
Revisiting Self-Training for Few-Shot Learning of Language ModelCode1
Machine Learning with Knowledge Constraints for Process Optimization of Open-Air Perovskite Solar Cell ManufacturingCode1
Phonetic Word EmbeddingsCode1
MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence using Federated EvaluationCode1
Benchmarking Graph Neural Networks on Dynamic Link PredictionCode1
"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken ConversationsCode1
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language UnderstandingCode1
Show:102550
← PrevPage 126 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified