SOTAVerified

Multiple-choice

Papers

Showing 121130 of 1107 papers

TitleStatusHype
LEAVS: An LLM-based Labeler for Abdominal CT SupervisionCode0
MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific ResearchCode1
Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data0
It is Too Many Options: Pitfalls of Multiple-Choice Questions in Generative AI and Medical Education0
The Impact of Item-Writing Flaws on Difficulty and Discrimination in Item Response Theory0
SeqSAM: Autoregressive Multiple Hypothesis Prediction for Medical Image Segmentation using SAMCode0
Mellow: a small audio language model for reasoningCode2
Identity Lock: Locking API Fine-tuned LLMs With Identity-based Wake Words0
VisBias: Measuring Explicit and Implicit Social Biases in Vision Language ModelsCode0
Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations0
Show:102550
← PrevPage 13 of 111Next →

No leaderboard results yet.