Multiple-choice

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1100 of 1107 papers

Title	Date	Tasks	Status
Beyond English-Only Reading Comprehension: Experiments in Zero-Shot Multilingual Transfer for Bulgarian	Aug 5, 2019	Multiple-choicePhilosophy	CodeCode Available
A quantitative study of NLP approaches to question difficulty estimation	May 17, 2023	MathMultiple-choice	CodeCode Available
Unified Question Answering in Slovene	Nov 16, 2022	Cross-Lingual TransferDecoder	CodeCode Available
Neural Natural Logic Inference for Interpretable Question Answering	Nov 1, 2021	Multiple-choiceNatural Language Inference	CodeCode Available
Limited Ability of LLMs to Simulate Human Psychological Behaviours: a Psychometric Analysis	May 12, 2024	Multiple-choiceQuestion Answering	CodeCode Available
FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domain	Apr 9, 2023	Multiple-choiceMultiple Choice Question Answering (MCQA)	CodeCode Available
Real-Time Automated Answer Scoring	Oct 13, 2022	Multiple-choice	CodeCode Available
Automated Generation and Tagging of Knowledge Components from Multiple-Choice Questions	May 30, 2024	Language ModellingLarge Language Model	CodeCode Available
LiveQA: A Question Answering Dataset over Sports Live	Oct 1, 2020	Multiple-choiceQuestion Answering	CodeCode Available
CASE: Commonsense-Augmented Score with an Expanded Answer Space	Nov 3, 2023	Multiple-choice	CodeCode Available
Which Shortcut Solution Do Question Answering Models Prefer to Learn?	Nov 29, 2022	Multiple-choiceQuestion Answering	CodeCode Available
From Recognition to Cognition: Visual Commonsense Reasoning	Nov 27, 2018	Multiple-choiceMultiple Choice Question Answering (MCQA)	CodeCode Available
FSBench: A Figure Skating Benchmark for Advancing Artistic Sports Understanding	Jan 1, 2025	Action RecognitionMultiple-choice	CodeCode Available
LLaVA-OneVision: Easy Visual Task Transfer	Aug 6, 2024	3D Question Answering (3D-QA)	CodeCode Available
Fusing Models with Complementary Expertise	Oct 2, 2023	Multiple-choicetext-classification	CodeCode Available
A Benchmark for Long-Form Medical Question Answering	Nov 14, 2024	Answer GenerationForm	CodeCode Available
Fùxì: A Benchmark for Evaluating Language Models on Ancient Chinese Text Understanding and Generation	Mar 20, 2025	Multiple-choiceText Generation	CodeCode Available
ReCoMIF: Reading comprehension based multi-source information fusion network for Chinese spoken language understanding	Aug 1, 2023	Intent DetectionMultiple-choice	CodeCode Available
NLP at UC Santa Cruz at SemEval-2024 Task 5: Legal Answer Validation using Few-Shot Multi-Choice QA	Apr 4, 2024	Multiple-choice	CodeCode Available
Gendered Pronoun Resolution using BERT and an extractive question answering formulation	Jun 9, 2019	coreference-resolutionCoreference Resolution	CodeCode Available
Noise Injection Reveals Hidden Capabilities of Sandbagging Language Models	Dec 2, 2024	MMLUMultiple-choice	CodeCode Available
Spoken Language Intelligence of Large Language Models for Language Learning	Aug 28, 2023	Language AcquisitionMultiple-choice	CodeCode Available
ReGraP-LLaVA: Reasoning enabled Graph-based Personalized Large Language and Vision Assistant	May 6, 2025	DescriptiveMultiple-choice	CodeCode Available
LLMs Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark for Comprehensive Evaluation of LLMs	Jun 7, 2024	Mathematical ReasoningMultiple-choice	CodeCode Available
Balancing Rigor and Utility: Mitigating Cognitive Biases in Large Language Models for Multiple-Choice Questions	Jun 16, 2024	Decision MakingLanguage Modelling	CodeCode Available
What Makes Reading Comprehension Questions Difficult?	Mar 12, 2022	Logical ReasoningMultiple-choice	CodeCode Available
Wait, that's not an option: LLMs Robustness with Incorrect Multiple-Choice Options	Aug 27, 2024	Decision MakingMultiple-choice	CodeCode Available
COLUMBUS: Evaluating COgnitive Lateral Understanding through Multiple-choice reBUSes	Sep 6, 2024	Multiple-choiceQuestion Answering	CodeCode Available
An Information-Theoretic Approach to Analyze NLP Classification Tasks	Feb 1, 2024	Multiple-choiceReading Comprehension	CodeCode Available
World Knowledge in Multiple Choice Reading Comprehension	Nov 13, 2022	General KnowledgeMultiple-choice	CodeCode Available
NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models	Oct 11, 2024	Multiple-choiceTruthfulQA	CodeCode Available
Are Large Language Models Consistent over Value-laden Questions?	Jul 3, 2024	Multiple-choice	CodeCode Available
Revisiting Visual Question Answering Baselines	Jun 27, 2016	Binary ClassificationMultiple-choice	CodeCode Available
LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models	Oct 13, 2024	HallucinationHallucination Evaluation	CodeCode Available
BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering	May 25, 2023	Binary ClassificationKnowledge Graphs	CodeCode Available
Automatic Generation and Evaluation of Reading Comprehension Test Items with Large Language Models	Apr 11, 2024	Multiple-choiceReading Comprehension	CodeCode Available
Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding	Apr 20, 2025	Autonomous DrivingImage Captioning	CodeCode Available
Abductive Commonsense Reasoning	Aug 15, 2019	Multiple-choiceNatural Language Inference	CodeCode Available
A Multiple Choices Reading Comprehension Corpus for Vietnamese Language Education	Mar 31, 2023	ArticlesMachine Reading Comprehension	CodeCode Available
When an LLM is apprehensive about its answers -- and when its uncertainty is justified	Mar 3, 2025	MathMMLU	CodeCode Available
Grade Score: Quantifying LLM Performance in Option Selection	Jun 17, 2024	Decision MakingFairness	CodeCode Available
Look at the Text: Instruction-Tuned Language Models are More Robust Multiple Choice Selectors than You Think	Apr 12, 2024	Multiple-choice	CodeCode Available
This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs	Mar 7, 2025	Large Language ModelMultiple-choice	CodeCode Available
StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding	Oct 19, 2023	Multiple-choiceNatural Language Understanding	CodeCode Available
Grounding Synthetic Data Evaluations of Language Models in Unsupervised Document Corpora	May 13, 2025	BenchmarkingDiagnostic	CodeCode Available
From Multiple-Choice to Extractive QA: A Case Study for English and Arabic	Apr 26, 2024	BelebeleExtractive Question-Answering	CodeCode Available
ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning	Feb 7, 2025	Multiple-choiceQuestion Answering	CodeCode Available
Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice Selectors	Jun 3, 2024	Multiple-choiceSelection bias	CodeCode Available
QMOS: Enhancing LLMs for Telecommunication with Question Masked loss and Option Shuffling	Sep 21, 2024	Multiple-choicePrompt Engineering	CodeCode Available
Truth Knows No Language: Evaluating Truthfulness Beyond English	Feb 13, 2025	InformativenessMachine Translation	CodeCode Available

Show:10 25 50

← PrevPage 22 of 23Next →

No leaderboard results yet.