SOTAVerified

Benchmarking

Papers

Showing 37263750 of 5548 papers

TitleStatusHype
Benchmarking Model Predictive Control Algorithms in Building Optimization Testing Framework (BOPTEST)0
Multifactorial Cellular Genetic Algorithm (MFCGA): Algorithmic Design, Performance Comparison and Genetic Transferability Analysis0
Multi-Fidelity Methods for Optimization: A Survey0
Benchmarking Evolutionary Algorithms For Single Objective Real-valued Constrained Optimization - A Critical Review0
Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition0
MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans0
Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 20300
Multi-input Multi-output Loewner Framework for Vibration-based Damage Detection on a Trainer Jet0
Benchmarking Estimators for Natural Experiments: A Novel Dataset and a Doubly Robust Algorithm0
Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations0
Benchmarking energy consumption and latency for neuromorphic computing in condensed matter and particle physics0
Multilingual European Language Models: Benchmarking Approaches and Challenges0
Multilingual Large Language Models Are Not (Yet) Code-Switchers0
Multilingual Protest News Detection - Shared Task 1, CASE 20210
Benchmarking Energy-Conserving Neural Networks for Learning Dynamics from Data0
Benchmarking Energy and Latency in TinyML: A Novel Method for Resource-Constrained AI0
MultiMed: Massively Multimodal and Multitask Medical Understanding0
Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models0
A Data-Driven Method to Identify IBRs with Dominant Participation in Sub-Synchronous Oscillations0
Towards Sentiment Analysis of Tobacco Products’ Usage in Social Media0
Multimodal Deep Learning for Scientific Imaging Interpretation0
Multimodal Deep Reinforcement Learning for Portfolio Optimization0
Multi-Modal Explainable Medical AI Assistant for Trustworthy Human-AI Collaboration0
Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms0
Benchmarking End-to-end Learning of MIMO Physical-Layer Communication0
Show:102550
← PrevPage 150 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified