SOTAVerified

Benchmarking

Papers

Showing 17811790 of 5548 papers

TitleStatusHype
Coherent Feed Forward Quantum Neural Network0
Rethinking Coherence Modeling: Synthetic vs. Downstream Tasks0
DiS-ReX: A Multilingual Dataset for Distantly Supervised Relation Extraction0
ChemPile: A 250GB Diverse and Curated Dataset for Chemical Foundation Models0
An Empirical Study of Benchmarking Chinese Aspect Sentiment Quad Prediction0
User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance0
ChatGPT vs State-of-the-Art Models: A Benchmarking Study in Keyphrase Generation Task0
Colonoscopy 3D Video Dataset with Paired Depth from 2D-3D Registration0
Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics0
Discriminative Link Prediction using Local Links, Node Features and Community Structure0
Show:102550
← PrevPage 179 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified