SOTAVerified

Benchmarking

Papers

Showing 19011925 of 5548 papers

TitleStatusHype
A novel database of Children's Spontaneous Facial Expressions (LIRIS-CSE)0
Estimating the Effect of Crosstalk Error on Circuit Fidelity Using Noisy Intermediate-Scale Quantum Devices0
Benchmarking and Performance Modelling of MapReduce Communication Pattern0
ADCB: An Alzheimer's disease benchmark for evaluating observational estimators of causal effects0
Channel Attention based Iterative Residual Learning for Depth Map Super-Resolution0
Benchmarking and Optimization of Gradient Boosting Decision Tree Algorithms0
A novel machine learning based framework for detection of Autism Spectrum Disorder (ASD)0
Benchmarking Zero-Shot Recognition with Vision-Language Models: Challenges on Granularity and Specificity0
Efficacy of Synthetic Data as a Benchmark0
Efficiency in European Air Traffic Management -- A Fundamental Analysis of Data, Models, and Methods0
CroCoDL: Cross-device Collaborative Dataset for Localization0
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models0
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models0
Cross-functional transferability in universal machine learning interatomic potentials0
Benchmarking Domain Generalization on EEG-based Emotion Recognition0
A Novel Momentum-Based Deep Learning Techniques for Medical Image Classification and Segmentation0
Efficient Benchmarking of NLP APIs using Multi-armed Bandits0
crossMoDA Challenge: Evolution of Cross-Modality Domain Adaptation Techniques for Vestibular Schwannoma and Cochlea Segmentation from 2021 to 20230
Challenges in Benchmarking Stream Learning Algorithms with Real-world Data0
Challenges and Pitfalls of Machine Learning Evaluation and Benchmarking0
Cross-replication Reliability -- An Empirical Approach to Interpreting Inter-rater Reliability0
Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater Reliability0
Cross-subject Brain Functional Connectivity Analysis for Multi-task Cognitive State Evaluation0
Cross-Subject Deep Transfer Models for Evoked Potentials in Brain-Computer Interface0
Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation0
Show:102550
← PrevPage 77 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified