Multiple-choice

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–675 of 1107 papers

Title	Date	Tasks	Status
Predicting the Difficulty of Multiple Choice Questions in a High-stakes Medical Exam	Aug 1, 2019	Multiple-choiceQuestion Answering	—Unverified
Predictions from language models for multiple-choice tasks are not robust under variation of scoring methods	Mar 1, 2024	Multiple-choice	—Unverified
Probabilistic Consensus through Ensemble Validation: A Framework for LLM Reliability	Nov 10, 2024	Multiple-choiceText Generation	—Unverified
Prompt Engineering and Calibration for Zero-Shot Commonsense Reasoning	Apr 14, 2023	Multiple-choicePrompt Engineering	—Unverified
Prompting Implicit Discourse Relation Annotation	Feb 7, 2024	ClassificationImplicit Discourse Relation Classification	—Unverified
Instruction Fine-Tuning: Does Prompt Loss Matter?	Jan 24, 2024	Multiple-choicetoken-classification	—Unverified
ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding	Nov 7, 2024	BenchmarkingMultiple-choice	—Unverified
ConceptPsy:A Benchmark Suite with Conceptual Comprehensiveness in Psychology	Nov 16, 2023	MMLUMultiple-choice	—Unverified
PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics Capabilities	Jan 13, 2024	Instruction FollowingMultiple-choice	—Unverified
Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs	Sep 30, 2024	BenchmarkingMultiple-choice	—Unverified
Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs	Jan 1, 2025	Multiple-choiceVideo Generation	—Unverified
QOG:Question and Options Generation based on Language Model	Jun 18, 2024	Information RetrievalLanguage Modeling	—Unverified
QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism	Jun 19, 2024	Multiple-choiceQuestion Answering	—Unverified
VisNumBench: Evaluating Number Sense of Multimodal Large Language Models	Mar 19, 2025	Multiple-choice	—Unverified
Query Rewriting for Retrieval-Augmented Large Language Models	May 23, 2023	Language ModelingLanguage Modelling	—Unverified
Question Difficulty Ranking for Multiple-Choice Reading Comprehension	Apr 16, 2024	Multiple-choiceReading Comprehension	—Unverified
Question-type Identification for Academic Questions in Online Learning Platform	Nov 24, 2022	Binary ClassificationMultiple-choice	—Unverified
Visual7W: Grounded Question Answering in Images	Nov 11, 2015	Multiple-choiceMultiple Choice Question Answering (MCQA)	—Unverified
Ranking Facts for Explaining Answers to Elementary Science Questions	Oct 18, 2021	Interpretable Machine LearningLearning-To-Rank	—Unverified
Ranking Large Language Models without Ground Truth	Feb 21, 2024	Multiple-choiceTriplet	—Unverified
Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking	Jan 7, 2021	Entity LinkingMachine Reading Comprehension	—Unverified
RECAP-KG: Mining Knowledge Graphs from Raw GP Notes for Remote COVID-19 Assessment in Primary Care	Jun 17, 2023	Decision Makinggraph construction	—Unverified
Receptivity of an AI Cognitive Assistant by the Radiology Community: A Report on Data Collected at RSNA	Sep 13, 2020	Multiple-choiceQuestion Answering	—Unverified
Recurrent and Contextual Models for Visual Question Answering	Mar 23, 2017	DiversityMultiple-choice	—Unverified
Visual Madlibs: Fill in the Blank Description Generation and Question Answering	Dec 1, 2015	Multiple-choiceQuestion Answering	—Unverified

Show:10 25 50

← PrevPage 27 of 45Next →

No leaderboard results yet.