SOTAVerified

Multiple-choice

Papers

Showing 701710 of 1107 papers

TitleStatusHype
SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity0
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models0
Advanced Financial Reasoning at Scale: A Comprehensive Evaluation of Large Language Models on CFA Level III0
Addressing Blind Guessing: Calibration of Selection Bias in Multiple-Choice Question Answering by Video Language Models0
From Human Days to Machine Seconds: Automatically Answering and Generating Machine Learning Final Exams0
A Data-Driven Study of Commonsense Knowledge using the ConceptNet Knowledge Base0
Seeing the Forest and the Trees: Solving Visual Graph and Tree Based Data Structure Problems using Large Multimodal Models0
Selective Particle Attention: Visual Feature-Based Attention in Deep Reinforcement Learning0
Self-Evaluation Improves Selective Generation in Large Language Models0
Adaptive Wizard for Removing Cross-Tier Misconfigurations in Active Directory0
Show:102550
← PrevPage 71 of 111Next →

No leaderboard results yet.