Multiple Choice Question Answering (MCQA)

A multiple-choice question (MCQ) is composed of two parts: a stem that identifies the question or problem, and a set of alternatives or possible answers that contain a key that is the best answer to the question, and a number of distractors that are plausible but incorrect answers to the question.

In a k-way MCQA task, a model is provided with a question q, a set of candidate options O = {O1, . . . , Ok}, and a supporting context for each option C = {C1, . . . , Ck}. The model needs to predict the correct answer option that is best supported by the given contexts.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 65 papers

Title	Date	Tasks	Status	Hype
Llama 2: Open Foundation and Fine-Tuned Chat Models	Jul 18, 2023	Arithmetic Reasoning	CodeCode Available	8
Training Compute-Optimal Large Language Models	Mar 29, 2022	AnachronismsAnalogical Similarity	CodeCode Available	6
Galactica: A Large Language Model for Science	Nov 16, 2022	AnachronismsBias Detection	CodeCode Available	4
MEDITRON-70B: Scaling Medical Pretraining for Large Language Models	Nov 27, 2023	ArticlesConditional Text Generation	CodeCode Available	4
PaLM: Scaling Language Modeling with Pathways	Apr 5, 2022	Auto DebuggingCode Generation	CodeCode Available	2
Scaling Language Models: Methods, Analysis & Insights from Training Gopher	Dec 8, 2021	Abstract AlgebraAnachronisms	CodeCode Available	2
MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering	Mar 27, 2022	DiversityMultiple-choice	CodeCode Available	2
Counterfactual Variable Control for Robust and Interpretable Question Answering	Oct 12, 2020	Causal Inferencecounterfactual	CodeCode Available	1
Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations	Oct 2, 2023	In-Context LearningInstruction Following	CodeCode Available	1
IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages	Nov 8, 2020	Genre classificationMultiple-choice	CodeCode Available	1
QuALITY: Question Answering with Long Input Texts, Yes!	Dec 16, 2021	Multiple-choiceMultiple Choice Question Answering (MCQA)	CodeCode Available	1
Variational Open-Domain Question Answering	Sep 23, 2022	Language ModellingMedQA	CodeCode Available	1
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models	May 17, 2023	Instruction FollowingMultiple-choice	CodeCode Available	1
Clues Before Answers: Generation-Enhanced Multiple-Choice QA	Apr 30, 2022	DecoderMultiple-choice	CodeCode Available	1
Can large language models reason about medical questions?	Jul 17, 2022	MedQAMultiple-choice	CodeCode Available	1
LexGLUE: A Benchmark Dataset for Legal Language Understanding in English	Oct 3, 2021	Multi-class ClassificationMulti-Label Classification	CodeCode Available	1
Large Language Models Encode Clinical Knowledge	Dec 26, 2022	Clinical KnowledgeMedQA	CodeCode Available	1
AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of Low-Rank Adaptation Experts	May 1, 2024	Multiple Choice Question Answering (MCQA)	CodeCode Available	1
Leveraging Large Language Models for Multiple Choice Question Answering	Oct 22, 2022	Answer SelectionMultiple-choice	CodeCode Available	1
Towards Expert-Level Medical Question Answering with Large Language Models	May 16, 2023	Medical Question AnsweringMedQA	CodeCode Available	1
CP-Router: An Uncertainty-Aware Router Between LLM and LRM	May 26, 2025	Conformal PredictionLogical Reasoning	—Unverified	0
BloombergGPT: A Large Language Model for Finance	Mar 30, 2023	Causal JudgmentCommon Sense Reasoning	—Unverified	0
Context Modeling with Evidence Filter for Multiple Choice Question Answering	Oct 6, 2020	Machine Reading ComprehensionMultiple-choice	—Unverified	0
Context-guided Triple Matching for Multiple Choice Question Answering	Jan 16, 2022	BenchmarkingMultiple-choice	—Unverified	0
BioMedGPT: Open Multimodal Generative Pre-trained Transformer for BioMedicine	Aug 18, 2023	Few-Shot LearningLanguage Modeling	—Unverified	0

Show:10 25 50

← PrevPage 1 of 3Next →

No leaderboard results yet.