SOTAVerified

Multiple-choice

Papers

Showing 171180 of 1107 papers

TitleStatusHype
Constructing Narrative Event Evolutionary Graph for Script Event PredictionCode1
Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and LayersCode1
Logic-Guided Data Augmentation and Regularization for Consistent Question AnsweringCode1
LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?Code1
CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and ReasoningCode1
IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian LanguagesCode1
IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerceCode1
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language ModelsCode1
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought PromptingCode1
HCQA @ Ego4D EgoSchema Challenge 2024Code1
Show:102550
← PrevPage 18 of 111Next →

No leaderboard results yet.