SOTAVerified

Multiple-choice

Papers

Showing 851875 of 1107 papers

TitleStatusHype
CinePile: A Long Video Question Answering Dataset and Benchmark0
Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents0
ClinBench-HPB: A Clinical Benchmark for Evaluating LLMs in Hepato-Pancreato-Biliary Diseases0
An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension0
CLIP-UP: CLIP-Based Unanswerable Problem Detection for Visual Question Answering0
Clozer: Adaptable Data Augmentation for Cloze-style Reading Comprehension0
Clozer”:" Adaptable Data Augmentation for Cloze-style Reading Comprehension0
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge0
A New Era: Intelligent Tutoring Systems Will Transform Online Learning for Millions0
CoddLLM: Empowering Large Language Models for Data Analytics0
CodeReviewQA: The Code Review Comprehension Assessment for Large Language Models0
COGNET-MD, an evaluation framework and dataset for Large Language Model benchmarks in the medical domain0
Cognitive Biases in Large Language Models: A Survey and Mitigation Experiments0
Collaboration among Multiple Large Language Models for Medical Question Answering0
Thrilled by Your Progress! Large Language Models (GPT-4) No Longer Struggle to Pass Assessments in Higher Education Programming Courses0
Combinatorial framework for planning in geological exploration0
Combining Multiple Cues for Visual Madlibs Question Answering0
Comparative Study of Learning Outcomes for Online Learning Platforms0
Thunder-NUBench: A Benchmark for LLMs' Sentence-Level Negation Understanding0
Confidence-Aware Learning Assistant0
You Can Do Better! If You Elaborate the Reason When Making Prediction0
Context-guided Triple Matching for Multiple Choice Question Answering0
Context-guided Triple Matching for Multiple Choice Question Answering0
Context Modeling with Evidence Filter for Multiple Choice Question Answering0
Contextual Response Interpretation for Automated Structured Interviews: A Case Study in Market Research0
Show:102550
← PrevPage 35 of 45Next →

No leaderboard results yet.