SOTAVerified

Benchmarking

Papers

Showing 30513060 of 5548 papers

TitleStatusHype
ImputeGAP: A Comprehensive Library for Time Series Imputation0
Benchmarking Table Comprehension In The Wild0
InAttention: Linear Context Scaling for Transformers0
Inaugural MOASEI Competition at AAMAS'2025: A Technical Report0
INCLUSIFY: A benchmark and a model for gender-inclusive German0
The Partial Response Network: a neural network nomogram0
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding0
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages0
IndicSTR12: A Dataset for Indic Scene Text Recognition0
Benchmarking Systematic Relational Reasoning with Large Language and Reasoning Models0
Show:102550
← PrevPage 306 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified