SOTAVerified

Multiple-choice

Papers

Showing 10011025 of 1107 papers

TitleStatusHype
Answer-level Calibration for Free-form Multiple Choice Question AnsweringCode0
Sentence Embeddings for Russian NLUCode0
Language Models as Knowledge Bases for Visual Word Sense DisambiguationCode0
Multimodal Residual Learning for Visual QACode0
QASC: A Dataset for Question Answering via Sentence CompositionCode0
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic SegmentationCode0
Simulating Training Data Leakage in Multiple-Choice Benchmarks for LLM EvaluationCode0
Every Answer Matters: Evaluating Commonsense with Probabilistic MeasuresCode0
Evidence Sentence Extraction for Machine Reading ComprehensionCode0
BertaQA: How Much Do Language Models Know About Local Culture?Code0
EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language ModelsCode0
SNS-Bench-VL: Benchmarking Multimodal Large Language Models in Social Networking ServicesCode0
BERT-based distractor generation for Swedish reading comprehension questions using a small-scale datasetCode0
Quantitative Assessment of Intersectional Empathetic Bias and UnderstandingCode0
Explanatory Argument Extraction of Correct Answers in Resident Medical ExamsCode0
Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical DataCode0
Cascading Biases: Investigating the Effect of Heuristic Annotation Strategies on Data and ModelsCode0
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context LearningCode0
Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language ModelsCode0
Exploring Iterative Enhancement for Improving Learnersourced Multiple-Choice Question Explanations with Large Language ModelsCode0
Question Answering as Global Reasoning over Semantic AbstractionsCode0
KnowledgePrompts: Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced PromptingCode0
Multiple Hypothesis Dropout: Estimating the Parameters of Multi-Modal Output DistributionsCode0
Question-Aware Knowledge Graph Prompting for Enhancing Large Language ModelsCode0
An Automatic Question Usability Evaluation ToolkitCode0
Show:102550
← PrevPage 41 of 45Next →

No leaderboard results yet.