Multiple-choice

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1001–1050 of 1107 papers

Title	Date	Tasks	Status
Answer-level Calibration for Free-form Multiple Choice Question Answering	May 1, 2022	FormLanguage Modeling	CodeCode Available
Sentence Embeddings for Russian NLU	Oct 29, 2019	Multiple-choiceParaphrase Identification	CodeCode Available
Language Models as Knowledge Bases for Visual Word Sense Disambiguation	Oct 3, 2023	Image CaptioningMultiple-choice	CodeCode Available
Multimodal Residual Learning for Visual QA	Jun 5, 2016	Multiple-choiceQuestion Answering	CodeCode Available
QASC: A Dataset for Question Answering via Sentence Composition	Oct 25, 2019	Common Sense ReasoningMulti-hop Question Answering	CodeCode Available
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation	Aug 15, 2017	Language ModelingLanguage Modelling	CodeCode Available
Simulating Training Data Leakage in Multiple-Choice Benchmarks for LLM Evaluation	May 30, 2025	Continual PretrainingFairness	CodeCode Available
Every Answer Matters: Evaluating Commonsense with Probabilistic Measures	Jun 6, 2024	Common Sense ReasoningLanguage Modeling	CodeCode Available
Evidence Sentence Extraction for Machine Reading Comprehension	Feb 23, 2019	Machine Reading ComprehensionMultiple-choice	CodeCode Available
BertaQA: How Much Do Language Models Know About Local Culture?	Jun 11, 2024	Multiple-choiceTransfer Learning	CodeCode Available
EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models	Mar 15, 2024	MiscellaneousMultiple-choice	CodeCode Available
SNS-Bench-VL: Benchmarking Multimodal Large Language Models in Social Networking Services	May 29, 2025	BenchmarkingInformation Retrieval	CodeCode Available
BERT-based distractor generation for Swedish reading comprehension questions using a small-scale dataset	Aug 9, 2021	Distractor GenerationMultiple-choice	CodeCode Available
Quantitative Assessment of Intersectional Empathetic Bias and Understanding	Nov 8, 2024	Multiple-choice	CodeCode Available
Explanatory Argument Extraction of Correct Answers in Resident Medical Exams	Dec 1, 2023	Multiple-choice	CodeCode Available
Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data	Jun 4, 2024	Clinical KnowledgeMultiple-choice	CodeCode Available
Cascading Biases: Investigating the Effect of Heuristic Annotation Strategies on Data and Models	Oct 24, 2022	Multiple-choiceReading Comprehension	CodeCode Available
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context Learning	Aug 7, 2023	In-Context LearningMath	CodeCode Available
Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models	Apr 2, 2024	Distractor GenerationIn-Context Learning	CodeCode Available
Exploring Iterative Enhancement for Improving Learnersourced Multiple-Choice Question Explanations with Large Language Models	Sep 19, 2023	Explanation GenerationLanguage Modelling	CodeCode Available
Question Answering as Global Reasoning over Semantic Abstractions	Jun 9, 2019	Information RetrievalMultiple-choice	CodeCode Available
KnowledgePrompts: Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting	Dec 1, 2024	Multiple-choiceMultiple Choice Question Answering (MCQA)	CodeCode Available
Multiple Hypothesis Dropout: Estimating the Parameters of Multi-Modal Output Distributions	Dec 18, 2023	Multiple-choicePedestrian Trajectory Prediction	CodeCode Available
Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models	Mar 30, 2025	Knowledge GraphsMultiple-choice	CodeCode Available
An Automatic Question Usability Evaluation Toolkit	May 30, 2024	Multiple-choiceWord Embeddings	CodeCode Available
SocialIQA: Commonsense Reasoning about Social Interactions	Apr 22, 2019	Common Sense ReasoningCoreference Resolution	CodeCode Available
Questioning the Survey Responses of Large Language Models	Jun 13, 2023	Multiple-choiceSurvey	CodeCode Available
Exposing the Limits of Video-Text Models through Contrast Sets	Jul 1, 2022	Language ModelingLanguage Modelling	CodeCode Available
Extracting Keywords from Open-Ended Business Survey Questions	Aug 31, 2018	Multiple-choiceSurvey	CodeCode Available
Question-Instructed Visual Descriptions for Zero-Shot Video Question Answering	Feb 16, 2024	Language ModelingLanguage Modelling	CodeCode Available
Learning to Attend On Essential Terms: An Enhanced Retriever-Reader Model for Open-domain Question Answering	Aug 28, 2018	AI2 Reasoning ChallengeARC	CodeCode Available
Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor	Dec 8, 2024	MisconceptionsMultiple-choice	CodeCode Available
Learning to Reuse Distractors to support Multiple Choice Question Generation in Education	Oct 25, 2022	Multiple-choiceQuestion Generation	CodeCode Available
"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models	Feb 22, 2024	Multiple-choiceText Generation	CodeCode Available
Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A	Feb 20, 2024	Language ModellingLarge Language Model	CodeCode Available
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERT	Aug 22, 2020	Multiple-choiceQuestion Answering	CodeCode Available
Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers	Oct 15, 2024	Multiple-choice	CodeCode Available
TRACE: Transformer-based Risk Assessment for Clinical Evaluation	Nov 13, 2024	Decision MakingMissing Values	CodeCode Available
LEAVS: An LLM-based Labeler for Abdominal CT Supervision	Mar 17, 2025	AnatomyLarge Language Model	CodeCode Available
Wrong Answers Can Also Be Useful: PlausibleQA -- A Large-Scale QA Dataset with Answer Plausibility Scores	Feb 22, 2025	Distractor GenerationInformation Retrieval	CodeCode Available
Length Optimization in Conformal Prediction	Jun 27, 2024	Conformal PredictionLanguage Modeling	CodeCode Available
FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation Framework	Apr 9, 2021	Language ModellingMultiple-choice	CodeCode Available
Training-free LLM Merging for Multi-task Learning	Jun 14, 2025	Multiple-choiceMulti-Task Learning	CodeCode Available
Solving and Generating NPR Sunday Puzzles with Large Language Models	Jun 21, 2023	Multiple-choicePrompt Engineering	CodeCode Available
HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models	Feb 9, 2025	Answer GenerationLanguage Modeling	CodeCode Available
Towards Efficient Methods in Medical Question Answering using Knowledge Graph Embeddings	Jan 15, 2024	Knowledge Graph EmbeddingsKnowledge Graphs	CodeCode Available
UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice Questions	Apr 20, 2024	Data AugmentationMultiple-choice	CodeCode Available
Solving Machine Learning Problems	Jul 2, 2021	BIG-bench Machine LearningData Augmentation	CodeCode Available
Leveraging large language models for nano synthesis mechanism explanation: solid foundations or mere conjectures?	Jul 12, 2024	Logical ReasoningMultiple-choice	CodeCode Available
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding	Jan 10, 2025	Automatic Speech RecognitionClassification	CodeCode Available

Show:10 25 50

← PrevPage 21 of 23Next →

No leaderboard results yet.