SOTAVerified

Benchmarking

Papers

Showing 34513460 of 5548 papers

TitleStatusHype
Multifactorial Cellular Genetic Algorithm (MFCGA): Algorithmic Design, Performance Comparison and Genetic Transferability Analysis0
Multi-Fidelity Methods for Optimization: A Survey0
MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans0
Multi-input Multi-output Loewner Framework for Vibration-based Damage Detection on a Trainer Jet0
Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations0
Multilingual European Language Models: Benchmarking Approaches and Challenges0
Multilingual Large Language Models Are Not (Yet) Code-Switchers0
Multilingual Protest News Detection - Shared Task 1, CASE 20210
MultiMed: Massively Multimodal and Multitask Medical Understanding0
Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models0
Show:102550
← PrevPage 346 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified