SOTAVerified

Multiple-choice

Papers

Showing 851860 of 1107 papers

TitleStatusHype
CinePile: A Long Video Question Answering Dataset and Benchmark0
Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents0
ClinBench-HPB: A Clinical Benchmark for Evaluating LLMs in Hepato-Pancreato-Biliary Diseases0
An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension0
CLIP-UP: CLIP-Based Unanswerable Problem Detection for Visual Question Answering0
Clozer: Adaptable Data Augmentation for Cloze-style Reading Comprehension0
Clozer”:" Adaptable Data Augmentation for Cloze-style Reading Comprehension0
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge0
A New Era: Intelligent Tutoring Systems Will Transform Online Learning for Millions0
CoddLLM: Empowering Large Language Models for Data Analytics0
Show:102550
← PrevPage 86 of 111Next →

No leaderboard results yet.