SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4341–4350 of 5548 papers

Title	Date	Tasks	Status	Hype
WER We Stand: Benchmarking Urdu ASR Models	Sep 17, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
What can 5.17 billion regression fits tell us about artificial models of the human visual system?	Oct 12, 2021	Benchmarking	—Unverified	0
What cleaves? Is proteasomal cleavage prediction reaching a ceiling?	Oct 24, 2022	BenchmarkingDenoising	—Unverified	0
What Does Neuro Mean to Cardio? Investigating the Role of Clinical Specialty Data in Medical LLMs	May 15, 2025	AllBenchmarking	—Unverified	0
What Emotions Make One or Five Stars? Understanding Ratings of Online Product Reviews by Sentiment Analysis and XAI	Feb 29, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified	0
What if we had no Wikipedia? Domain-independent Term Extraction from a Large News Corpus	Sep 17, 2020	BenchmarkingTerm Extraction	—Unverified	0
Alexpaca: Learning Factual Clarification Question Generation Without Examples	Oct 17, 2023	BenchmarkingChatbot	—Unverified	0
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts	Aug 1, 2021	BenchmarkingBinary Classification	—Unverified	0
Towards Self-adaptive Mutation in Evolutionary Multi-Objective Algorithms	Mar 8, 2023	BenchmarkingEvolutionary Algorithms	—Unverified	0
What Will it Take to Fix Benchmarking in Natural Language Understanding?	Apr 5, 2021	BenchmarkingNatural Language Understanding	—Unverified	0

Show:10 25 50

← PrevPage 435 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified