SOTAVerified|Agents Browse Leaderboard About

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1381–1390 of 5548 papers

Title	Date	Tasks	Status	Hype
A Critical Assessment of State-of-the-Art in Entity Alignment	Oct 30, 2020	BenchmarkingEntity Alignment	CodeCode Available	1
Benchmarking Deep Learning Interpretability in Time Series Predictions	Oct 26, 2020	BenchmarkingDeep Learning	CodeCode Available	1
Kvasir-Instrument: Diagnostic and therapeutic tool segmentation dataset in gastrointestinal endoscopy	Oct 23, 2020	BenchmarkingDiagnostic	CodeCode Available	1
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi	Oct 23, 2020	ArticlesBenchmarking	CodeCode Available	1
Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets	Oct 22, 2020	ArticlesBenchmarking	CodeCode Available	1
Self-Alignment Pretraining for Biomedical Entity Representations	Oct 22, 2020	BenchmarkingEntity Linking	CodeCode Available	1
German's Next Language Model	Oct 21, 2020	BenchmarkingDocument Classification	CodeCode Available	1
Promoting High Diversity Ensemble Learning with EnsembleBench	Oct 20, 2020	BenchmarkingDiversity	CodeCode Available	1
RobustBench: a standardized adversarial robustness benchmark	Oct 19, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
RADIATE: A Radar Dataset for Automotive Perception in Bad Weather	Oct 18, 2020	Autonomous DrivingBenchmarking	CodeCode Available	1

Show:10 25 50

← PrevPage 139 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified