SOTAVerified

Benchmarking

Papers

Showing 41914200 of 5548 papers

TitleStatusHype
Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control TasksCode0
Identifying and Benchmarking Natural Out-of-Context Prediction ProblemsCode0
Scientific Machine Learning Benchmarks0
Benchmarking of Lightweight Deep Learning Architectures for Skin Cancer Classification using ISIC 2017 Dataset0
Learning with Noisy Labels Revisited: A Study Using Real-World Human AnnotationsCode1
MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems0
OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit SynthesisCode1
Text-Based Person Search with Limited DataCode1
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair PredictionCode0
An Open Natural Language Processing Development Framework for EHR-based Clinical Research: A case demonstration using the National COVID Cohort Collaborative (N3C)0
Show:102550
← PrevPage 420 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified