SOTAVerified

Multiple-choice

Papers

Showing 271280 of 1107 papers

TitleStatusHype
KnowledgePrompts: Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced PromptingCode0
Cognitive Biases in Large Language Models: A Survey and Mitigation Experiments0
Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark0
Applying IRT to Distinguish Between Human and Generative AI Responses to Multiple-Choice Assessments0
Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers0
Multiple Choice Learning for Efficient Speech Separation with Many Speakers0
CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language ModelsCode1
NEMO: Can Multimodal LLMs Identify Attribute-Modified Objects?0
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis0
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 LanguagesCode1
Show:102550
← PrevPage 28 of 111Next →

No leaderboard results yet.