SOTAVerified

Multiple-choice

Papers

Showing 221230 of 1107 papers

TitleStatusHype
Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States0
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model EvaluationCode1
(WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and ChallengesCode0
CLIP-UP: CLIP-Based Unanswerable Problem Detection for Visual Question Answering0
Unifying Specialized Visual Encoders for Video Language ModelsCode1
Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation0
Q-Bench-Video: Benchmark the Video Quality Understanding of LMMs0
Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering0
FSBench: A Figure Skating Benchmark for Advancing Artistic Sports UnderstandingCode0
IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models0
Show:102550
← PrevPage 23 of 111Next →

No leaderboard results yet.