SOTAVerified

Multiple-choice

Papers

Showing 751775 of 1107 papers

TitleStatusHype
LMVE at SemEval-2020 Task 4: Commonsense Validation and Explanation using Pretraining Language Model0
Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States0
Unlocking Video-LLM via Agent-of-Thoughts Distillation0
Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering0
LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning0
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models0
An Add-On for Empowering Google Forms to be an Automatic Question Generator in Online Assessments0
Unsupervised Explanation Generation for Machine Reading Comprehension0
Unsupervised multiple-choice question generation for out-of-domain Q\&A fine-tuning0
LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception0
LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion0
LookAlike: Consistent Distractor Generation in Math MCQs0
Looking Beyond Sentence-Level Natural Language Inference for Question Answering and Text Summarization0
Looking Beyond Short-Premise Natural Language Inference for Downstream Tasks0
Unsupervised multiple-choice question generation for out-of-domain Q&A fine-tuning0
Make a Choice! Knowledge Base Question Answering with In-Context Learning0
Amobee at SemEval-2019 Tasks 5 and 6: Multiple Choice CNN Over Contextual Embedding0
MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects0
Unsupervised multiple choices question answering via universal corpus0
MateInfoUB: A Real-World Benchmark for Testing LLMs in Competitive, Multilingual, and Multimodal Educational Tasks0
MaterialBENCH: Evaluating College-Level Materials Science Problem-Solving Abilities of Large Language Models0
Math Multiple Choice Question Generation via Human-Large Language Model Collaboration0
MCL-GAN: Generative Adversarial Networks with Multiple Specialized Discriminators0
MCQA-Eval: Efficient Confidence Evaluation in NLG with Gold-Standard Correctness Labels0
MCS-SQL: Leveraging Multiple Prompts and Multiple-Choice Selection For Text-to-SQL Generation0
Show:102550
← PrevPage 31 of 45Next →

No leaderboard results yet.