SOTAVerified

Benchmarking

Papers

Showing 38513860 of 5548 papers

TitleStatusHype
Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Culture0
Benchmarking CNN on 3D Anatomical Brain MRI: Architectures, Data Augmentation and Deep Ensemble Learning0
Benchmarking Clinical Decision Support Search0
No Dataset Needed for Downstream Knowledge Benchmarking: Response Dispersion Inversely Correlates with Accuracy on Domain-specific QA0
NODDI-SH: a computational efficient NODDI extension for fODF estimation in diffusion MRI0
Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition0
Node Classification Meets Link Prediction on Knowledge Graphs0
Nodule detection and generation on chest X-rays: NODE21 Challenge0
Training Transformers with Enforced Lipschitz Constants0
NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries0
Show:102550
← PrevPage 386 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified