SOTAVerified

Benchmarking

Papers

Showing 30513075 of 5548 papers

TitleStatusHype
ImputeGAP: A Comprehensive Library for Time Series Imputation0
Benchmarking Table Comprehension In The Wild0
InAttention: Linear Context Scaling for Transformers0
Inaugural MOASEI Competition at AAMAS'2025: A Technical Report0
INCLUSIFY: A benchmark and a model for gender-inclusive German0
The Partial Response Network: a neural network nomogram0
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding0
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages0
IndicSTR12: A Dataset for Indic Scene Text Recognition0
Benchmarking Systematic Relational Reasoning with Large Language and Reasoning Models0
A framework for benchmarking uncertainty in deep regression0
Individual Treatment Effect Estimation Through Controlled Neural Network Training in Two Stages0
The Pitfalls of Benchmarking in Algorithm Selection: What We Are Getting Wrong0
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP0
Benchmarking symbolic regression constant optimization schemes0
Benchmarking Surrogate-Assisted Genetic Recommender Systems0
Benchmarking Super-Resolution Algorithms on Real Data0
Influence-Optimistic Local Values for Multiagent Planning --- Extended Version0
InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation0
Benchmarking Sub-Genre Classification For Mainstage Dance Music0
InfoSEM: A Deep Generative Model with Informative Priors for Gene Regulatory Network Inference0
InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management0
Benchmarking state-of-the-art gradient boosting algorithms for classification0
Benchmarking State-of-the-Art Deep Learning Software Tools0
Benchmarking Spiking Neural Network Learning Methods with Varying Locality0
Show:102550
← PrevPage 123 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified