SOTAVerified

Multiple-choice

Papers

Showing 201210 of 1107 papers

TitleStatusHype
On the Reasoning Capacity of AI Models and How to Quantify It0
The AI Penalization Effect: People Reduce Compensation for Workers Who Use AI0
Patent Figure Classification using Large Vision-language ModelsCode0
Generating Plausible Distractors for Multiple-Choice Questions via Student Choice Prediction0
MedS^3: Towards Medical Small Language Models with Self-Evolved Slow ThinkingCode2
Can Multimodal LLMs do Visual Temporal Understanding and Reasoning? The answer is No!0
FaceXBench: Evaluating Multimodal LLMs on Face UnderstandingCode1
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong0
Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework0
Vision-Language Models Do Not Understand Negation0
Show:102550
← PrevPage 21 of 111Next →

No leaderboard results yet.