SOTAVerified

Benchmarking

Papers

Showing 38413850 of 5548 papers

TitleStatusHype
NEWTS: A Corpus for News Topic-Focused Summarization0
NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction0
Next-generation MRD assays: do we have the tools to evaluate them properly?0
Benchmarking confound regression strategies for the control of motion artifact in studies of functional connectivity0
NL2KQL: From Natural Language to Kusto Query0
Benchmarking and Building Zero-Shot Hindi Retrieval Model with Hindi-BEIR and NLLB-E50
Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noise0
NLPre: a revised approach towards language-centric benchmarking of Natural Language Preprocessing systems0
A CUDA-Based Real Parameter Optimization Benchmark0
Benchmarking Collaborative Learning Methods Cost-Effectiveness for Prostate Segmentation0
Show:102550
← PrevPage 385 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified