SOTAVerified

Multiple-choice

Papers

Showing 601610 of 1107 papers

TitleStatusHype
Instruction Fine-Tuning: Does Prompt Loss Matter?0
A Study on Large Language Models' Limitations in Multiple-Choice Question AnsweringCode0
Towards Efficient Methods in Medical Question Answering using Knowledge Graph EmbeddingsCode0
Assessing Large Language Models in Mechanical Engineering Education: A Study on Mechanics-Focused Conceptual Understanding0
Automated Answer Validation using Text Similarity0
PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics Capabilities0
A Novel Multi-Stage Prompting Approach for Language Agnostic MCQ Generation using GPTCode0
The Benefits of a Concise Chain of Thought on Problem-Solving in Large Language ModelsCode1
A Joint-Reasoning based Disease Q&A System0
SEED-Bench: Benchmarking Multimodal Large Language ModelsCode3
Show:102550
← PrevPage 61 of 111Next →

No leaderboard results yet.