SOTAVerified

Multiple-choice

Papers

Showing 531540 of 1107 papers

TitleStatusHype
Can multiple-choice questions really be useful in detecting the abilities of LLMs?Code0
Understanding Long Videos with Multimodal Language ModelsCode2
IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language ModelsCode1
LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models0
Pragmatic Competence Evaluation of Large Language Models for the Korean LanguageCode0
Enhancing Event Causality Identification with Rationale and Structure-Aware Causal Question Answering0
Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models0
EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language ModelsCode0
Towards Diverse Perspective Learning with Selection over Multiple Temporal PoolingsCode0
AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic0
Show:102550
← PrevPage 54 of 111Next →

No leaderboard results yet.