SOTAVerified

Benchmarking

Papers

Showing 41114120 of 5548 papers

TitleStatusHype
The Design and Implementation of a Scalable DL Benchmarking Platform0
The Disagreement Problem in Faithfulness Metrics0
The DLV System for Knowledge Representation and Reasoning0
The Dota 2 Bot Competition0
The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation0
The EuroCity Persons Dataset: A Novel Benchmark for Object Detection0
The Evolutionary Computation Methods No One Should Use0
The Expressive Power of Word Embeddings0
The Extractive-Abstractive Axis: Measuring Content "Borrowing" in Generative Language Models0
The FaceChannelS: Strike of the Sequences for the AffWild 2 Challenge0
Show:102550
← PrevPage 412 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified