SOTAVerified

Multiple-choice

Papers

Showing 551575 of 1107 papers

TitleStatusHype
MedKP: Medical Dialogue with Knowledge Enhancement and Clinical Pathway Encoding0
A Method for Building a Commonsense Inference Dataset based on Basic Events0
Unveiling Cultural Blind Spots: Analyzing the Limitations of mLLMs in Procedural Text Comprehension0
Med-RLVR: Emerging Medical Reasoning from a 3B base model via reinforcement Learning0
AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning0
UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces0
AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models0
Meta Sequence Learning for Generating Adequate Question-Answer Pairs0
MHQA: A Diverse, Knowledge Intensive Mental Health Question Answering Challenge for Language Models0
MIBench: Evaluating Multimodal Large Language Models over Multiple Images0
Use neural networks to recognize students' handwritten letters and incorrect symbols0
Using contradictions improves question answering systems0
Using Large Language Models for Automated Grading of Student Writing about Science0
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification0
MINI-LLM: Memory-Efficient Structured Pruning for Large Language Models0
Mitigating Bias for Question Answering Models by Tracking Bias Influence0
Mitigating Selection Bias with Node Pruning and Auxiliary Options0
MixQG: Neural Question Generation with Mixed Answer Types0
ZeroTuning: Unlocking the Initial Token's Power to Enhance Large Language Models Without Training0
A Comparative Study of AI-Generated (GPT-4) and Human-crafted MCQs in Programming Education0
A Joint-Reasoning based Disease Q&A System0
AI-based Arabic Language and Speech Tutor0
MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence0
VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It0
Modeling of Item-Difficulty for Ontology-based MCQs0
Show:102550
← PrevPage 23 of 45Next →

No leaderboard results yet.