SOTAVerified

Multiple-choice

Papers

Showing 951960 of 1107 papers

TitleStatusHype
AILS-NTUA at SemEval-2024 Task 9: Cracking Brain Teasers: Transformer Models for Lateral Thinking PuzzlesCode0
DyePack: Provably Flagging Test Set Contamination in LLMs Using BackdoorsCode0
EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research AssistantsCode0
MMM: Multi-stage Multi-task Learning for Multi-choice Reading ComprehensionCode0
Anchored Answers: Unravelling Positional Bias in GPT-2's Multiple-Choice QuestionsCode0
MM-PoE: Multiple Choice Reasoning via. Process of Elimination using Multi-Modal ModelsCode0
Pragmatic Competence Evaluation of Large Language Models for the Korean LanguageCode0
Which is the Effective Way for Gaokao: Information Retrieval or Neural Networks?Code0
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language ModelsCode0
Investigating the Shortcomings of LLMs in Step-by-Step Legal ReasoningCode0
Show:102550
← PrevPage 96 of 111Next →

No leaderboard results yet.