SOTAVerified

Multiple-choice

Papers

Showing 571580 of 1107 papers

TitleStatusHype
Digital Comprehensibility Assessment of Simplified Texts among Persons with Intellectual Disabilities0
BiMediX: Bilingual Medical Mixture of Experts LLMCode1
ArabicMMLU: Assessing Massive Multitask Language Understanding in ArabicCode1
Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&ACode0
Stick to your Role! Stability of Personal Values Expressed in Large Language Models0
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?Code0
Uncertainty quantification in fine-tuned LLMs using LoRA ensemblesCode0
KMMLU: Measuring Massive Multitask Language Understanding in Korean0
Question-Instructed Visual Descriptions for Zero-Shot Video Question AnsweringCode0
DE-COP: Detecting Copyrighted Content in Language Models Training DataCode0
Show:102550
← PrevPage 58 of 111Next →

No leaderboard results yet.