SOTAVerified

Benchmarking

Papers

Showing 37813790 of 5548 papers

TitleStatusHype
Less Is More: A Comparison of Active Learning Strategies for 3D Medical Image SegmentationCode1
HATE-ITA: New Baselines for Hate Speech Detection in ItalianCode0
SentSpace: Large-Scale Benchmarking and Evaluation of Text using Cognitively Motivated Lexical, Syntactic, and Semantic Features0
Towards Toxic Positivity Detection0
Benchmarking Intersectional Biases in NLPCode0
Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding0
DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles0
Dyna-bAbI: unlocking bAbI’s potential with dynamic synthetic benchmarking0
Benchmarking Language-agnostic Intent Classification for Virtual Assistant PlatformsCode0
Local manifold learning and its link to domain-based physics knowledgeCode0
Show:102550
← PrevPage 379 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified