SOTAVerified

Multiple-choice

Papers

Showing 121130 of 1107 papers

TitleStatusHype
Latxa: An Open Language Model and Evaluation Suite for BasqueCode1
Assessing the Chemical Intelligence of Large Language ModelsCode1
Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning FrameworkCode1
Leveraging Large Language Models for Learning Complex Legal Concepts through StorytellingCode1
LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language ModelsCode1
LifeQA: A Real-life Dataset for Video Question AnsweringCode1
A Hitchhikers Guide to Fine-Grained Face Forgery Detection Using Common Sense ReasoningCode1
FarsTail: A Persian Natural Language Inference DatasetCode1
FaceXBench: Evaluating Multimodal LLMs on Face UnderstandingCode1
Fake Alignment: Are LLMs Really Aligned Well?Code1
Show:102550
← PrevPage 13 of 111Next →

No leaderboard results yet.