SOTAVerified

Multiple-choice

Papers

Showing 161170 of 1107 papers

TitleStatusHype
GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA CapabilitiesCode1
Complex Reasoning over Logical Queries on Commonsense Knowledge GraphsCode1
Annealed Winner-Takes-All for Motion ForecastingCode1
CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language ModelsCode1
An Open Source Data Contamination Report for Large Language ModelsCode1
Constructing Narrative Event Evolutionary Graph for Script Event PredictionCode1
Ranked Voting based Self-Consistency of Large Language ModelsCode1
Fine-tuning Multimodal Large Language Models for Product BundlingCode1
CUPCase: Clinically Uncommon Patient Cases and Diagnoses DatasetCode1
CC-Riddle: A Question Answering Dataset of Chinese Character RiddlesCode1
Show:102550
← PrevPage 17 of 111Next →

No leaderboard results yet.