SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4111–4120 of 5548 papers

Title	Date	Tasks	Status	Hype
The Design and Implementation of a Scalable DL Benchmarking Platform	Nov 19, 2019	Benchmarking	—Unverified	0
The Disagreement Problem in Faithfulness Metrics	Nov 13, 2023	BenchmarkingExplainable artificial intelligence	—Unverified	0
The DLV System for Knowledge Representation and Reasoning	Nov 4, 2002	Benchmarking	—Unverified	0
The Dota 2 Bot Competition	Mar 4, 2021	BenchmarkingDota 2	—Unverified	0
The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation	Aug 1, 2021	BenchmarkingMachine Translation	—Unverified	0
The EuroCity Persons Dataset: A Novel Benchmark for Object Detection	May 18, 2018	BenchmarkingObject	—Unverified	0
The Evolutionary Computation Methods No One Should Use	Jan 5, 2023	Benchmarking	—Unverified	0
The Expressive Power of Word Embeddings	Jan 15, 2013	BenchmarkingSentence	—Unverified	0
The Extractive-Abstractive Axis: Measuring Content "Borrowing" in Generative Language Models	Jul 20, 2023	Benchmarking	—Unverified	0
The FaceChannelS: Strike of the Sequences for the AffWild 2 Challenge	Oct 4, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified	0

Show:10 25 50

← PrevPage 412 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified