SOTAVerified

Benchmarking

Papers

Showing 16711680 of 5548 papers

TitleStatusHype
Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Culture0
A Distance Oriented Kalman Filter Particle Swarm Optimizer Applied to Multi-Modality Image Registration0
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems0
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry0
DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning0
Benchmarking CNN on 3D Anatomical Brain MRI: Architectures, Data Augmentation and Deep Ensemble Learning0
Benchmarking Clinical Decision Support Search0
Ad-hoc Concept Forming in the Game Codenames as a Means for Evaluating Large Language Models0
DeepSIC: Deep Semantic Image Compression0
Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition0
Show:102550
← PrevPage 168 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified