SOTAVerified

Multiple-choice

Papers

Showing 10111020 of 1107 papers

TitleStatusHype
GANDALF: a General Character Name Description Dataset for Long Fiction0
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis0
Generalised Winograd Schema and its Contextuality0
Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data0
Who did What: A Large-Scale Person-Centered Cloze Dataset0
Generating Adequate Distractors for Multiple-Choice Questions0
Generating Correct Answers for Progressive Matrices Intelligence Tests0
Generating Diagnostic Multiple Choice Comprehension Cloze Questions0
Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers0
Generating multiple-choice questions for medical question answering with distractors and cue-masking0
Show:102550
← PrevPage 102 of 111Next →

No leaderboard results yet.