SOTAVerified

Multiple-choice

Papers

Showing 711720 of 1107 papers

TitleStatusHype
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model LeaderboardsCode0
An Information-Theoretic Approach to Analyze NLP Classification TasksCode0
Evaluating LLM -- Generated Multimodal Diagnosis from Medical Images and Symptom Analysis0
Towards Collective Superintelligence: Amplifying Group IQ using Conversational Swarms0
Instruction Fine-Tuning: Does Prompt Loss Matter?0
What Large Language Models Know and What People Think They Know0
Towards Efficient Methods in Medical Question Answering using Knowledge Graph EmbeddingsCode0
A Study on Large Language Models' Limitations in Multiple-Choice Question AnsweringCode0
Assessing Large Language Models in Mechanical Engineering Education: A Study on Mechanics-Focused Conceptual Understanding0
Automated Answer Validation using Text Similarity0
Show:102550
← PrevPage 72 of 111Next →

No leaderboard results yet.