SOTAVerified

Benchmarking

Papers

Showing 45114520 of 5548 papers

TitleStatusHype
Introducing SLAMBench, a performance and accuracy benchmarking methodology for SLAMCode0
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion RecognitionCode0
Integration of nested cross-validation, automated hyperparameter optimization, high-performance computing to reduce and quantify the variance of test performance estimation of deep learning modelsCode0
BdSLW60: A Word-Level Bangla Sign Language DatasetCode0
The Butterfly Effect of Model Editing: Few Edits Can Trigger Large Language Models CollapseCode0
Integrating Expert Knowledge into Logical Programs via LLMsCode0
The CaLiGraph Ontology as a Challenge for OWL ReasonersCode0
The Catechol Benchmark: Time-series Solvent Selection Data for Few-shot Machine LearningCode0
Strong and Simple Baselines for Multimodal Utterance EmbeddingsCode0
InstaIndoor and Multi-modal Deep Learning for Indoor Scene RecognitionCode0
Show:102550
← PrevPage 452 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified