SOTAVerified

Multiple-choice

Papers

Showing 10811090 of 1107 papers

TitleStatusHype
NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language ModelsCode0
Are Large Language Models Consistent over Value-laden Questions?Code0
Revisiting Visual Question Answering BaselinesCode0
LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language ModelsCode0
BUCA: A Binary Classification Approach to Unsupervised Commonsense Question AnsweringCode0
Automatic Generation and Evaluation of Reading Comprehension Test Items with Large Language ModelsCode0
Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video UnderstandingCode0
Abductive Commonsense ReasoningCode0
A Multiple Choices Reading Comprehension Corpus for Vietnamese Language EducationCode0
When an LLM is apprehensive about its answers -- and when its uncertainty is justifiedCode0
Show:102550
← PrevPage 109 of 111Next →

No leaderboard results yet.