SOTAVerified

Multiple-choice

Papers

Showing 741750 of 1107 papers

TitleStatusHype
Evaluating LLMs on Document-Based QA: Exact Answer Selection and Numerical Extraction using Cogtale dataset0
It's Not Easy Being Wrong: Large Language Models Struggle with Process of Elimination ReasoningCode0
Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks0
Assessing Distractors in Multiple-Choice Tests0
Evaluating multiple large language models in pediatric ophthalmology0
Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions0
More Robots are Coming: Large Multimodal Models (ChatGPT) can Solve Visually Diverse Images of Parsons Problems0
CASE: Commonsense-Augmented Score with an Expanded Answer SpaceCode0
DeSIQ: Towards an Unbiased, Challenging Benchmark for Social Intelligence Understanding0
POE: Process of Elimination for Multiple Choice ReasoningCode0
Show:102550
← PrevPage 75 of 111Next →

No leaderboard results yet.