SOTAVerified

Benchmarking

Papers

Showing 9911000 of 5548 papers

TitleStatusHype
Explainable Benchmarking for Iterative Optimization HeuristicsCode1
DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in ConversationsCode1
NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse TasksCode1
NAS-Bench-Graph: Benchmarking Graph Neural Architecture SearchCode1
Benchmarking Neural Network Robustness to Common Corruptions and Surface VariationsCode1
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERTCode1
DIG In: Evaluating Disparities in Image Generations with Indicators for Geographic DiversityCode1
DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific InformationCode1
Protein Structure Tokenization: Benchmarking and New RecipeCode1
Benchmarking Neural Network Generalization for Grammar InductionCode1
Show:102550
← PrevPage 100 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified