SOTAVerified

Multiple-choice

Papers

Showing 651675 of 1107 papers

TitleStatusHype
Predicting the Difficulty of Multiple Choice Questions in a High-stakes Medical Exam0
Predictions from language models for multiple-choice tasks are not robust under variation of scoring methods0
Probabilistic Consensus through Ensemble Validation: A Framework for LLM Reliability0
Prompt Engineering and Calibration for Zero-Shot Commonsense Reasoning0
Prompting Implicit Discourse Relation Annotation0
Instruction Fine-Tuning: Does Prompt Loss Matter?0
ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding0
ConceptPsy:A Benchmark Suite with Conceptual Comprehensiveness in Psychology0
PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics Capabilities0
Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs0
Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs0
QOG:Question and Options Generation based on Language Model0
QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism0
VisNumBench: Evaluating Number Sense of Multimodal Large Language Models0
Query Rewriting for Retrieval-Augmented Large Language Models0
Question Difficulty Ranking for Multiple-Choice Reading Comprehension0
Question-type Identification for Academic Questions in Online Learning Platform0
Visual7W: Grounded Question Answering in Images0
Ranking Facts for Explaining Answers to Elementary Science Questions0
Ranking Large Language Models without Ground Truth0
Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking0
RECAP-KG: Mining Knowledge Graphs from Raw GP Notes for Remote COVID-19 Assessment in Primary Care0
Receptivity of an AI Cognitive Assistant by the Radiology Community: A Report on Data Collected at RSNA0
Recurrent and Contextual Models for Visual Question Answering0
Visual Madlibs: Fill in the Blank Description Generation and Question Answering0
Show:102550
← PrevPage 27 of 45Next →

No leaderboard results yet.