SOTAVerified

Benchmarking

Papers

Showing 14011410 of 5548 papers

TitleStatusHype
BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scaleCode1
scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell DataCode1
Benchpress: A Scalable and Versatile Workflow for Benchmarking Structure Learning AlgorithmsCode1
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale DatasetCode1
Benchmarking Transcriptomics Foundation Models for Perturbation Analysis : one PCA still rules them allCode1
AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite ImageryCode1
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech SystemsCode1
IOHanalyzer: Detailed Performance Analyses for Iterative Optimization HeuristicsCode1
Benchmarking and scaling of deep learning models for land cover image classificationCode1
SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It)Code1
Show:102550
← PrevPage 141 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified