SOTAVerified

Multiple-choice

Papers

Showing 441450 of 1107 papers

TitleStatusHype
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language UnderstandingCode0
Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQsCode0
Knowledge Retrieval Based on Generative AI0
DRIVINGVQA: Analyzing Visual Chain-of-Thought Reasoning of Vision Language Models in Real-World Scenarios with Driving Theory Tests0
Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States0
(WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and ChallengesCode0
CLIP-UP: CLIP-Based Unanswerable Problem Detection for Visual Question Answering0
FSBench: A Figure Skating Benchmark for Advancing Artistic Sports UnderstandingCode0
Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation0
Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs0
Show:102550
← PrevPage 45 of 111Next →

No leaderboard results yet.