SOTAVerified

Benchmarking

Papers

Showing 34513475 of 5548 papers

TitleStatusHype
Multifactorial Cellular Genetic Algorithm (MFCGA): Algorithmic Design, Performance Comparison and Genetic Transferability Analysis0
Multi-Fidelity Methods for Optimization: A Survey0
MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans0
Multi-input Multi-output Loewner Framework for Vibration-based Damage Detection on a Trainer Jet0
Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations0
Multilingual European Language Models: Benchmarking Approaches and Challenges0
Multilingual Large Language Models Are Not (Yet) Code-Switchers0
Multilingual Protest News Detection - Shared Task 1, CASE 20210
MultiMed: Massively Multimodal and Multitask Medical Understanding0
Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models0
Multimodal Deep Learning for Scientific Imaging Interpretation0
Multimodal Deep Reinforcement Learning for Portfolio Optimization0
Multi-Modal Explainable Medical AI Assistant for Trustworthy Human-AI Collaboration0
Multimodal Information Retrieval for Open World with Edit Distance Weak Supervision0
Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes0
Multi-Modal Three-Stream Network for Action Recognition0
MultiON: Benchmarking Semantic Map Memory using Multi-Object Navigation0
LadderMIL: Multiple Instance Learning with Coarse-to-Fine Self-Distillation0
MultiRobustBench: Benchmarking Robustness Against Multiple Attacks0
MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts0
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing0
Non-linear Multitask Learning with Deep Gaussian Processes0
Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking0
Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception?0
Multi-view deep learning based molecule design and structural optimization accelerates the SARS-CoV-2 inhibitor discovery0
Show:102550
← PrevPage 139 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified