SOTAVerified

Benchmarking

Papers

Showing 38513875 of 5548 papers

TitleStatusHype
Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Culture0
Benchmarking CNN on 3D Anatomical Brain MRI: Architectures, Data Augmentation and Deep Ensemble Learning0
Benchmarking Clinical Decision Support Search0
No Dataset Needed for Downstream Knowledge Benchmarking: Response Dispersion Inversely Correlates with Accuracy on Domain-specific QA0
NODDI-SH: a computational efficient NODDI extension for fODF estimation in diffusion MRI0
Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition0
Node Classification Meets Link Prediction on Knowledge Graphs0
Nodule detection and generation on chest X-rays: NODE21 Challenge0
Training Transformers with Enforced Lipschitz Constants0
NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries0
NoisyHate: Mining Online Human-Written Perturbations for Realistic Robustness Benchmarking of Content Moderation Models0
Noisy intermediate-scale quantum (NISQ) algorithms0
Trajectory Normalized Gradients for Distributed Optimization0
ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities0
InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System0
Non-Contextual Modeling of Sarcasm using a Neural Network Benchmark0
Non-Reference Quality Assessment for Medical Imaging: Application to Synthetic Brain MRIs0
Nonstochastic Bandits with Infinitely Many Experts0
TRAM: Benchmarking Temporal Reasoning for Large Language Models0
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding0
Not Every Tree Is a Forest: Benchmarking Forest Types from Satellite Remote Sensing0
NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription0
NOVA: A Benchmark for Anomaly Localization and Clinical Reasoning in Brain MRI0
NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open Worlds0
Long Short-Term Memory with Gate and State Level Fusion for Light Field-Based Face Recognition0
Show:102550
← PrevPage 155 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified