SOTAVerified

Benchmarking

Papers

Showing 42014210 of 5548 papers

TitleStatusHype
Dyna-bAbI: unlocking bAbI’s potential with dynamic synthetic benchmarking0
HATE-ITA: New Baselines for Hate Speech Detection in ItalianCode0
Benchmarking Intersectional Biases in NLPCode0
SentSpace: Large-Scale Benchmarking and Evaluation of Text using Cognitively Motivated Lexical, Syntactic, and Semantic Features0
Local manifold learning and its link to domain-based physics knowledgeCode0
Analyzing the behaviour of D'WAVE quantum annealer: fine-tuning parameterization and tests with restrictive Hamiltonian formulations0
Benchmarking Language-agnostic Intent Classification for Virtual Assistant PlatformsCode0
Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding0
Computer-aided diagnosis and prediction in brain disorders0
An extensible Benchmarking Graph-Mesh dataset for studying Steady-State Incompressible Navier-Stokes EquationsCode0
Show:102550
← PrevPage 421 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified