SOTAVerified

Benchmarking

Papers

Showing 35013510 of 5548 papers

TitleStatusHype
NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction0
Next-generation MRD assays: do we have the tools to evaluate them properly?0
NL2KQL: From Natural Language to Kusto Query0
Benchmarking and Building Zero-Shot Hindi Retrieval Model with Hindi-BEIR and NLLB-E50
NLPre: a revised approach towards language-centric benchmarking of Natural Language Preprocessing systems0
No Dataset Needed for Downstream Knowledge Benchmarking: Response Dispersion Inversely Correlates with Accuracy on Domain-specific QA0
NODDI-SH: a computational efficient NODDI extension for fODF estimation in diffusion MRI0
Node Classification Meets Link Prediction on Knowledge Graphs0
Nodule detection and generation on chest X-rays: NODE21 Challenge0
NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries0
Show:102550
← PrevPage 351 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified