SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4241–4250 of 5548 papers

Title	Date	Tasks	Status	Hype
MTLens: Machine Translation Output Debugging	Jun 1, 2022	BenchmarkingMachine Translation	—Unverified	0
Hide and Seek: on the Stealthiness of Attacks against Deep Learning Systems	May 31, 2022	Benchmarking	—Unverified	0
NEWTS: A Corpus for News Topic-Focused Summarization	May 31, 2022	BenchmarkingText Summarization	—Unverified	0
bsnsing: A decision tree induction method based on recursive optimal boolean rule composition	May 30, 2022	Benchmarking	CodeCode Available	0
AI-enabled Sound Pattern Recognition on Asthma Medication Adherence: Evaluation with the RDA Benchmark Suite	May 30, 2022	BenchmarkingBIG-bench Machine Learning	CodeCode Available	0
Benchmarking Unsupervised Anomaly Detection and Localization	May 30, 2022	Anomaly DetectionBenchmarking	—Unverified	0
A Framework for Generating Informative Benchmark Instances	May 29, 2022	Benchmarking	CodeCode Available	0
Bias Reduction via Cooperative Bargaining in Synthetic Graph Dataset Generation	May 27, 2022	BenchmarkingDataset Generation	CodeCode Available	0
Benchmarking of Deep Learning models on 2D Laminar Flow behind Cylinder	May 26, 2022	BenchmarkingDeep Learning	—Unverified	0
Large Language Models are Few-Shot Clinical Information Extractors	May 25, 2022	Benchmarkingcoreference-resolution	—Unverified	0

Show:10 25 50

← PrevPage 425 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified