SOTAVerified

Multiple-choice

Papers

Showing 526550 of 1107 papers

TitleStatusHype
LMVE at SemEval-2020 Task 4: Commonsense Validation and Explanation using Pretraining Language Model0
Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla0
Benchmarking Bias in Large Language Models during Role-Playing0
Document-level Event Factuality Identification via Machine Reading Comprehension Frameworks with Transfer Learning0
DMind Benchmark: Toward a Holistic Assessment of LLM Capabilities across the Web3 Domain0
A Corpus of Text Data and Gaze Fixations from Autistic and Non-Autistic Adults0
Large Language Models Still Exhibit Bias in Long Text0
DiverseNet: When One Right Answer is not Enough0
Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets0
Learning a Word-Level Language Model with Sentence-Level Noise Contrastive Estimation for Contextual Sentence Probability Estimation0
Distributional semantics beyond words: Supervised learning of analogy and paraphrase0
Distractor Generation in Multiple-Choice Tasks: A Survey of Methods, Datasets, and Evaluation0
Bayesian Statistical Modeling with Predictors from LLMs0
A Weak Supervision Approach for Predicting Difficulty of Technical Interview Questions0
Large Language Models (GPT) Struggle to Answer Multiple-Choice Questions about Code0
Large Language Models Often Know When They Are Being Evaluated0
Distractor Analysis and Selection for Multiple-Choice Cloze Questions for Second-Language Learners0
DISTO: Evaluating Textual Distractors for Multi-Choice Questions using Negative Sampling based Approach0
Auxiliary Class Based Multiple Choice Learning0
Disaggregating Hops: Can We Guide a Multi-Hop Reasoning Language Model to Incrementally Learn at each Hop?0
An Improved Traditional Chinese Evaluation Suite for Foundation Model0
A Foundational Multimodal Vision Language AI Assistant for Human Pathology0
Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions0
Learning Language-Visual Embedding for Movie Understanding with Natural-Language0
Digital Comprehensibility Assessment of Simplified Texts among Persons with Intellectual Disabilities0
Show:102550
← PrevPage 22 of 45Next →

No leaderboard results yet.