SOTAVerified

Benchmarking

Papers

Showing 871880 of 5548 papers

TitleStatusHype
Benchmarking Deep Graph Generative Models for Optimizing New Drug Molecules for COVID-19Code1
AI in Lung Health: Benchmarking Detection and Diagnostic Models Across Multiple CT Scan DatasetsCode1
CIPCaD-Bench: Continuous Industrial Process datasets for benchmarking Causal Discovery methodsCode1
Initial recommendations for performing, benchmarking, and reporting single-cell proteomics experimentsCode1
In Search of Lost Online Test-time Adaptation: A SurveyCode1
Insights from Benchmarking Frontier Language Models on Web App Code GenerationCode1
A Survey of Pathology Foundation Model: Progress and Future DirectionsCode1
A Comprehensive Benchmark for RNA 3D Structure-Function ModelingCode1
GEOM-Drugs Revisited: Toward More Chemically Accurate Benchmarks for 3D Molecule GenerationCode1
Circumventing shortcuts in audio-visual deepfake detection datasets with unsupervised learningCode1
Show:102550
← PrevPage 88 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified