SOTAVerified

Multiple-choice

Papers

Showing 701725 of 1107 papers

TitleStatusHype
Is This Collection Worth My LLM's Time? Automatically Measuring Information Potential in Text Corpora0
An Algorithm for Generating Gap-Fill Multiple Choice Questions of an Expert System0
It is Too Many Options: Pitfalls of Multiple-Choice Questions in Generative AI and Medical Education0
Winning Amazon KDD Cup'240
KMMLU: Measuring Massive Multitask Language Understanding in Korean0
Knowledge-Driven Distractor Generation for Cloze-style Multiple Choice Questions0
Knowledge Questions from Knowledge Graphs0
Knowledge Retrieval Based on Generative AI0
KoBALT: Korean Benchmark For Advanced Linguistic Tasks0
KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean Healthcare Professional Licensing Examinations0
KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge0
KRISTEVA: Close Reading as a Novel Task for Benchmarking Interpretive Reasoning0
LAB-Bench: Measuring Capabilities of Language Models for Biology Research0
LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs0
Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model0
Language models are susceptible to incorrect patient self-diagnosis in medical applications0
Uncovering Cultural Representation Disparities in Vision-Language Models0
Language Models (Mostly) Know What They Know0
Uncovering Temporal Context for Video Question and Answering0
LAR-ECHR: A New Legal Argument Reasoning Task and Dataset for Cases of the European Court of Human Rights0
Large Language Models Are Self-Taught Reasoners: Enhancing LLM Applications via Tailored Problem-Solving Demonstrations0
Large Language Models Could Be Rote Learners0
Understanding Dataset Design Choices for Multi-hop Reasoning0
Large Language Models (GPT) Struggle to Answer Multiple-Choice Questions about Code0
Large Language Models Often Know When They Are Being Evaluated0
Show:102550
← PrevPage 29 of 45Next →

No leaderboard results yet.