SOTAVerified

Multiple-choice

Papers

Showing 971980 of 1107 papers

TitleStatusHype
Increasing Probability Mass on Answer Choices Does Not Always Improve AccuracyCode0
Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?Code0
Iterative Forward Tuning Boosts In-Context Learning in Language ModelsCode0
Can We Guide a Multi-Hop Reasoning Language Model to Incrementally Learn at Each Single-Hop?Code0
BnMMLU: Measuring Massive Multitask Language Understanding in BengaliCode0
It's Not Easy Being Wrong: Large Language Models Struggle with Process of Elimination ReasoningCode0
Investigating Prior Knowledge for Challenging Chinese Machine Reading ComprehensionCode0
Joint Learning of Sentence Embeddings for Relevance and EntailmentCode0
Enhancing textual textbook question answering with large language models and retrieval augmented generationCode0
Kaleidoscope: In-language Exams for Massively Multilingual Vision EvaluationCode0
Show:102550
← PrevPage 98 of 111Next →

No leaderboard results yet.