SOTAVerified

Multiple-choice

Papers

Showing 10911100 of 1107 papers

TitleStatusHype
Grade Score: Quantifying LLM Performance in Option SelectionCode0
Look at the Text: Instruction-Tuned Language Models are More Robust Multiple Choice Selectors than You ThinkCode0
This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMsCode0
StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical UnderstandingCode0
Grounding Synthetic Data Evaluations of Language Models in Unsupervised Document CorporaCode0
From Multiple-Choice to Extractive QA: A Case Study for English and ArabicCode0
ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and ReasoningCode0
Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice SelectorsCode0
QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option ShufflingCode0
Truth Knows No Language: Evaluating Truthfulness Beyond EnglishCode0
Show:102550
← PrevPage 110 of 111Next →

No leaderboard results yet.