SOTAVerified

Multiple-choice

Papers

Showing 961970 of 1107 papers

TitleStatusHype
Precise Task Formalization Matters in Winograd Schema EvaluationsCode0
Towards a Unified Multimodal Reasoning FrameworkCode0
IPEval: A Bilingual Intellectual Property Agency Consultation Evaluation Benchmark for Large Language ModelsCode0
iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain TeasersCode0
Eliciting Informative Text Evaluations with Large Language ModelsCode0
ElimiNet: A Model for Eliminating Options for Reading Comprehension with Multiple Choice QuestionsCode0
Self-Recognition in Language ModelsCode0
EMBRACE: Evaluation and Modifications for Boosting RACECode0
Can multiple-choice questions really be useful in detecting the abilities of LLMs?Code0
Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual AlignmentCode0
Show:102550
← PrevPage 97 of 111Next →

No leaderboard results yet.