SOTAVerified

Multiple-choice

Papers

Showing 691700 of 1107 papers

TitleStatusHype
SaL-Lightning Dataset: Search and Eye Gaze Behavior, Resource Interactions and Knowledge Gain during Web Search0
Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models0
SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning0
SaudiCulture: A Benchmark for Evaluating Large Language Models Cultural Competence within Saudi Arabia0
SB-Bench: Stereotype Bias Benchmark for Large Multimodal Models0
SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark0
Scene Restoring for Narrative Machine Reading Comprehension0
Scheduling Algorithms for Federated Learning with Minimal Energy Consumption0
VITAL: A New Dataset for Benchmarking Pluralistic Alignment in Healthcare0
GeoSQA: A Benchmark for Scenario-based Question Answering in the Geography Domain at High School Level0
Show:102550
← PrevPage 70 of 111Next →

No leaderboard results yet.