SOTAVerified

Benchmarking

Papers

Showing 17711780 of 5548 papers

TitleStatusHype
CheXwhatsApp: A Dataset for Exploring Challenges in the Diagnosis of Chest X-rays through Mobile Devices0
DISC: a Dataset for Integrated Sensing and Communication in mmWave Systems0
LAraBench: Benchmarking Arabic AI with Large Language Models0
CKnowEdit: A New Chinese Knowledge Editing Dataset for Linguistics, Facts, and Logic Error Correction in LLMs0
ChemTime: Rapid and Early Classification for Multivariate Time Series Classification of Chemical Sensors0
An Empirical Study of Super-resolution on Low-resolution Micro-expression Recognition0
Disambiguation in Conversational Question Answering in the Era of LLM: A Survey0
DISCOMAN: Dataset of Indoor SCenes for Odometry, Mapping And Navigation0
ChemPile: A 250GB Diverse and Curated Dataset for Chemical Foundation Models0
An Empirical Study of Benchmarking Chinese Aspect Sentiment Quad Prediction0
Show:102550
← PrevPage 178 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified