SOTAVerified

Multiple-choice

Papers

Showing 691700 of 1107 papers

TitleStatusHype
Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering ModelsCode1
EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language UnderstandingCode1
A Comparative Study of Open-Source Large Language Models, GPT-4 and Claude 2: Multiple-Choice Test Taking in Nephrology0
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context LearningCode0
ChatGPT for GTFS: Benchmarking LLMs on GTFS Understanding and RetrievalCode0
ReCoMIF: Reading comprehension based multi-source information fusion network for Chinese spoken language understandingCode0
MovieChat: From Dense Token to Sparse Memory for Long Video UnderstandingCode2
Distractor generation for multiple-choice questions with predictive prompting and large language modelsCode0
SEED-Bench: Benchmarking Multimodal LLMs with Generative ComprehensionCode2
A large language model-assisted education tool to provide feedback on open-ended responsesCode0
Show:102550
← PrevPage 70 of 111Next →

No leaderboard results yet.