SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4191–4200 of 5548 papers

Title	Date	Tasks	Status	Hype
Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks	Oct 25, 2021	Benchmarkingcontinuous-control	CodeCode Available	0
Identifying and Benchmarking Natural Out-of-Context Prediction Problems	Oct 25, 2021	Benchmarking	CodeCode Available	0
Scientific Machine Learning Benchmarks	Oct 25, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Benchmarking of Lightweight Deep Learning Architectures for Skin Cancer Classification using ISIC 2017 Dataset	Oct 23, 2021	BenchmarkingCancer Classification	—Unverified	0
Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations	Oct 22, 2021	BenchmarkingLearning with noisy labels	CodeCode Available	1
MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems	Oct 21, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified	0
OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit Synthesis	Oct 21, 2021	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
Text-Based Person Search with Limited Data	Oct 20, 2021	BenchmarkingContrastive Learning	CodeCode Available	1
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair Prediction	Oct 20, 2021	BenchmarkingLanguage Modeling	CodeCode Available	0
An Open Natural Language Processing Development Framework for EHR-based Clinical Research: A case demonstration using the National COVID Cohort Collaborative (N3C)	Oct 20, 2021	Benchmarking	—Unverified	0

Show:10 25 50

← PrevPage 420 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified