SOTAVerified

Benchmarking

Papers

Showing 821830 of 5548 papers

TitleStatusHype
Benchmarking Low-Shot Robustness to Natural Distribution ShiftsCode1
How to Benchmark Vision Foundation Models for Semantic Segmentation?Code1
Benchmarking LLMs' Swarm intelligenceCode1
Dynatask: A Framework for Creating Dynamic AI Benchmark TasksCode1
Benchmarking of DL Libraries and Models on Mobile DevicesCode1
Benchmarking LLMs for Political Science: A United Nations PerspectiveCode1
HyFactor: Hydrogen-count labelled graph-based defactorization AutoencoderCode1
Earnings-22: A Practical Benchmark for Accents in the WildCode1
EBES: Easy Benchmarking for Event SequencesCode1
A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation, and Research ChallengesCode1
Show:102550
← PrevPage 83 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified