SOTAVerified

Benchmarking

Papers

Showing 36513660 of 5548 papers

TitleStatusHype
Alexpaca: Learning Factual Clarification Question Generation Without Examples0
Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech0
Benchmarking Foundation Models with Language-Model-as-an-Examiner0
Benchmarking Foundation Models for Zero-Shot Biometric Tasks0
MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents0
MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases0
Benchmarking foundation models as feature extractors for weakly-supervised computational pathology0
Model Agnostic Explainable Selective Regression via Uncertainty Estimation0
Model-based trajectory stitching for improved behavioural cloning and its applications0
Model-Based Underwater 6D Pose Estimation from RGB0
Show:102550
← PrevPage 366 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified