Multiple-choice

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 901–950 of 1107 papers

Title	Date	Tasks	Status
A Novel Multi-Stage Prompting Approach for Language Agnostic MCQ Generation using GPT	Jan 13, 2024	Distractor GenerationMultiple-choice	CodeCode Available
Can Large Language Models Provide Security & Privacy Advice? Measuring the Ability of LLMs to Refute Misconceptions	Oct 3, 2023	MisconceptionsMultiple-choice	CodeCode Available
DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation	Jun 13, 2024	BenchmarkingHallucination	CodeCode Available
ToMChallenges: A Principle-Guided Dataset and Diverse Evaluation Tasks for Exploring Theory of Mind	May 24, 2023	Multiple-choiceQuestion Answering	CodeCode Available
CLOMO: Counterfactual Logical Modification with Large Language Models	Nov 29, 2023	counterfactualCounterfactual Reasoning	CodeCode Available
IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark for LLMs	Nov 12, 2024	coreference-resolutionCoreference Resolution	CodeCode Available
DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?	Jun 18, 2024	Language ModelingLanguage Modelling	CodeCode Available
SecQA: A Concise Question-Answering Dataset for Evaluating Large Language Models in Computer Security	Dec 26, 2023	Computer SecurityMultiple-choice	CodeCode Available
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?	Jun 1, 2021	Multiple-choiceNatural Language Understanding	CodeCode Available
Assessing the Quality of Multiple-Choice Questions Using GPT-4 and Rule-Based Methods	Jul 16, 2023	Multiple-choice	CodeCode Available
TAXI: Evaluating Categorical Knowledge Editing for Language Models	Apr 23, 2024	knowledge editingMultiple-choice	CodeCode Available
WiCkeD: A Simple Method to Make Multiple Choice Benchmarks More Challenging	Feb 25, 2025	MMLUMultiple-choice	CodeCode Available
What Makes Reading Comprehension Questions Easier?	Aug 28, 2018	Machine Reading ComprehensionMultiple-choice	CodeCode Available
Downstream Trade-offs of a Family of Text Watermarks	Nov 16, 2023	FormLanguage Modelling	CodeCode Available
Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches	May 18, 2025	FairnessMemorization	CodeCode Available
A multimodal dataset for understanding the impact of mobile phones on remote online virtual education	Dec 13, 2024	EEGHead Pose Estimation	CodeCode Available
Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning	Oct 9, 2024	HallucinationMultiple-choice	CodeCode Available
Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty?	Jul 7, 2024	Multiple-choice	CodeCode Available
Differentiating Choices via Commonality for Multiple-Choice Question Answering	Aug 21, 2024	Multiple-choiceMultiple Choice Question Answering (MCQA)	CodeCode Available
Utilizing Background Knowledge for Robust Reasoning over Traffic Situations	Dec 4, 2022	Knowledge GraphsMultiple-choice	CodeCode Available
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs	Oct 15, 2024	Image DescriptionMultiple-choice	CodeCode Available
Improving Machine Reading Comprehension with General Reading Strategies	Oct 31, 2018	ARCLanguage Modeling	CodeCode Available
A large language model-assisted education tool to provide feedback on open-ended responses	Jul 25, 2023	Language ModelingLanguage Modelling	CodeCode Available
DisGeM: Distractor Generation for Multiple Choice Questions with Span Masking	Sep 26, 2024	Distractor GenerationMultiple-choice	CodeCode Available
Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction	Mar 5, 2025	In-Context LearningMultiple-choice	CodeCode Available
Improving Question Answering with External Knowledge	Feb 3, 2019	ARCMultiple-choice	CodeCode Available
Distractor Generation for Multiple Choice Questions Using Learning to Rank	Jun 1, 2018	BIG-bench Machine LearningDistractor Generation	CodeCode Available
Distractor generation for multiple-choice questions with predictive prompting and large language models	Jul 30, 2023	Distractor GenerationMultiple-choice	CodeCode Available
A Study on Large Language Models' Limitations in Multiple-Choice Question Answering	Jan 15, 2024	Multiple-choiceQuestion Answering	CodeCode Available
MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question Answering	Nov 1, 2021	multimodal interactionMultiple-choice	CodeCode Available
INCEPTNET: Precise And Early Disease Detection Application For Medical Images Analyses	Sep 5, 2023	Cell DetectionLesion Segmentation	CodeCode Available
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions	Jun 27, 2024	Distractor GenerationMath	CodeCode Available
DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models	Oct 2, 2024	Multiple-choiceparameter-efficient fine-tuning	CodeCode Available
DMCL: Distillation Multiple Choice Learning for Multimodal Action Recognition	Dec 23, 2019	Action RecognitionMultiple-choice	CodeCode Available
Plausibly Problematic Questions in Multiple-Choice Benchmarks for Commonsense Reasoning	Oct 6, 2024	Multiple-choice	CodeCode Available
Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs	Jan 10, 2025	Multiple-choice	CodeCode Available
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models	May 30, 2025	MathMultiple-choice	CodeCode Available
Does Multiple Choice Have a Future in the Age of Generative AI? A Posttest-only RCT	Dec 13, 2024	Multiple-choice	CodeCode Available
VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation	Jun 25, 2024	ARCBenchmarking	CodeCode Available
POE: Process of Elimination for Multiple Choice Reasoning	Oct 24, 2023	In-Context LearningLogical Reasoning	CodeCode Available
When Retriever-Reader Meets Scenario-Based Multiple-Choice Questions	Aug 31, 2021	Multiple-choiceQuestion Answering	CodeCode Available
UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions	Jun 18, 2024	BenchmarkingMultiple-choice	CodeCode Available
MLaKE: Multilingual Knowledge Editing Benchmark for Large Language Models	Apr 7, 2024	Benchmarkingknowledge editing	CodeCode Available
A Joint Sequence Fusion Model for Video Question Answering and Retrieval	Aug 7, 2018	DecoderMultiple-choice	CodeCode Available
Vision-Language and Large Language Model Performance in Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, and Quantized Models	Aug 25, 2024	Language ModelingLanguage Modelling	CodeCode Available
ChatGPT for GTFS: Benchmarking LLMs on GTFS Understanding and Retrieval	Aug 4, 2023	BenchmarkingInformation Retrieval	CodeCode Available
A Profit-Maximizing Strategy for Advertising on the e-Commerce Platforms	Oct 31, 2022	ManagementMultiple-choice	CodeCode Available
DREAM: A Challenge Dataset and Models for Dialogue-Based Reading Comprehension	Feb 1, 2019	Dialogue UnderstandingMultiple-choice	CodeCode Available
Introducing a framework to assess newly created questions with Natural Language Processing	Apr 28, 2020	Multiple-choice	CodeCode Available
Introducing Flexible Monotone Multiple Choice Item Response Theory Models and Bit Scales	Oct 2, 2024	Multiple-choice	CodeCode Available

Show:10 25 50

← PrevPage 19 of 23Next →

No leaderboard results yet.