SOTAVerified

Multiple-choice

Papers

Showing 326350 of 1107 papers

TitleStatusHype
Leveraging large language models for nano synthesis mechanism explanation: solid foundations or mere conjectures?Code0
LiveQA: A Question Answering Dataset over Sports LiveCode0
Eliciting Informative Text Evaluations with Large Language ModelsCode0
Towards Efficient Methods in Medical Question Answering using Knowledge Graph EmbeddingsCode0
HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language ModelsCode0
LLaVA-OneVision: Easy Visual Task TransferCode0
LEAVS: An LLM-based Labeler for Abdominal CT SupervisionCode0
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language ModelsCode0
A Novel Multi-Stage Prompting Approach for Language Agnostic MCQ Generation using GPTCode0
Length Optimization in Conformal PredictionCode0
Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning DistractorCode0
Learning to Reuse Distractors to support Multiple Choice Question Generation in EducationCode0
Learning to Attend On Essential Terms: An Enhanced Retriever-Reader Model for Open-domain Question AnsweringCode0
Beyond English-Only Reading Comprehension: Experiments in Zero-Shot Multilingual Transfer for BulgarianCode0
EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research AssistantsCode0
Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answersCode0
DyePack: Provably Flagging Test Set Contamination in LLMs Using BackdoorsCode0
BERT-based distractor generation for Swedish reading comprehension questions using a small-scale datasetCode0
SCoRE: Benchmarking Long-Chain Reasoning in Commonsense ScenariosCode0
DREAM: A Challenge Dataset and Models for Dialogue-Based Reading ComprehensionCode0
ElimiNet: A Model for Eliminating Options for Reading Comprehension with Multiple Choice QuestionsCode0
BertaQA: How Much Do Language Models Know About Local Culture?Code0
EMBRACE: Evaluation and Modifications for Boosting RACECode0
Language Models as Knowledge Bases for Visual Word Sense DisambiguationCode0
It's Not Easy Being Wrong: Large Language Models Struggle with Process of Elimination ReasoningCode0
Show:102550
← PrevPage 14 of 45Next →

No leaderboard results yet.