SOTAVerified

Multiple-choice

Papers

Showing 10711080 of 1107 papers

TitleStatusHype
Noise Injection Reveals Hidden Capabilities of Sandbagging Language ModelsCode0
Spoken Language Intelligence of Large Language Models for Language LearningCode0
ReGraP-LLaVA: Reasoning enabled Graph-based Personalized Large Language and Vision AssistantCode0
LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMsCode0
Balancing Rigor and Utility: Mitigating Cognitive Biases in Large Language Models for Multiple-Choice QuestionsCode0
What Makes Reading Comprehension Questions Difficult?Code0
Wait, that's not an option: LLMs Robustness with Incorrect Multiple-Choice OptionsCode0
COLUMBUS: Evaluating COgnitive Lateral Understanding through Multiple-choice reBUSesCode0
An Information-Theoretic Approach to Analyze NLP Classification TasksCode0
World Knowledge in Multiple Choice Reading ComprehensionCode0
Show:102550
← PrevPage 108 of 111Next →

No leaderboard results yet.