SOTAVerified|Agents Browse Leaderboard About

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1260 of 5548 papers

Title	Date	Tasks	Status	Hype
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale Dataset	Oct 11, 2021	BenchmarkingFace Hallucination	CodeCode Available	1
SERAB: A multi-lingual benchmark for speech emotion recognition	Oct 7, 2021	BenchmarkingEmotion Recognition	CodeCode Available	1
EntQA: Entity Linking as Question Answering	Oct 5, 2021	BenchmarkingEntity Linking	CodeCode Available	1
Revisiting Self-Training for Few-Shot Learning of Language Model	Oct 4, 2021	BenchmarkingFew-Shot Learning	CodeCode Available	1
Machine Learning with Knowledge Constraints for Process Optimization of Open-Air Perovskite Solar Cell Manufacturing	Oct 1, 2021	Bayesian OptimizationBenchmarking	CodeCode Available	1
Phonetic Word Embeddings	Sep 30, 2021	BenchmarkingWord Embeddings	CodeCode Available	1
MedPerf: Open Benchmarking Platform for Medical Artificial Intelligence using Federated Evaluation	Sep 29, 2021	BenchmarkingPhilosophy	CodeCode Available	1
Benchmarking Graph Neural Networks on Dynamic Link Prediction	Sep 29, 2021	BenchmarkingDynamic Link Prediction	CodeCode Available	1
"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations	Sep 28, 2021	BenchmarkingDialogue State Tracking	CodeCode Available	1
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding	Sep 27, 2021	BenchmarkingNatural Language Understanding	CodeCode Available	1

Show:10 25 50

← PrevPage 126 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified