SOTAVerified

Benchmarking

Papers

Showing 981990 of 5548 papers

TitleStatusHype
NeuroGraph: Benchmarks for Graph Machine Learning in Brain ConnectomicsCode1
Yet Another ICU Benchmark: A Flexible Multi-Center Framework for Clinical MLCode1
On the Detectability of ChatGPT Content: Benchmarking, Methodology, and Evaluation through the Lens of Academic WritingCode1
Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam DatasetCode1
RepoBench: Benchmarking Repository-Level Code Auto-Completion SystemsCode1
Str2Str: A Score-based Framework for Zero-shot Protein Conformation SamplingCode1
TransDocAnalyser: A Framework for Offline Semi-structured Handwritten Document Analysis in the Legal DomainCode1
BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language modelsCode1
Multilingual Conceptual Coverage in Text-to-Image ModelsCode1
Spatially Resolved Gene Expression Prediction from H&E Histology Images via Bi-modal Contrastive LearningCode1
Show:102550
← PrevPage 99 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified