SOTAVerified

Multiple-choice

Papers

Showing 651660 of 1107 papers

TitleStatusHype
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?0
Analyzing the Performance of ChatGPT in Cardiology and Vascular Pathologies0
Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information0
HFL-RC System at SemEval-2018 Task 11: Hybrid Multi-Aspects Model for Commonsense Reading Comprehension0
Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation0
HindiLLM: Large Language Model for Hindi0
Analyzing Multiple-Choice Reading and Listening Comprehension Tests0
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites0
How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering?0
How Many Workers to Ask? Adaptive Exploration for Collecting High Quality Labels0
Show:102550
← PrevPage 66 of 111Next →

No leaderboard results yet.