SOTAVerified

Benchmarking

Papers

Showing 30713080 of 5548 papers

TitleStatusHype
It's all about PR -- Smart Benchmarking AI Accelerators using Performance Representatives0
Reinforcement Learning to Disentangle Multiqubit Quantum States from Partial ObservationsCode0
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets0
MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases0
A PRISMA Driven Systematic Review of Publicly Available Datasets for Benchmark and Model Developments for Industrial Defect Detection0
Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing0
Benchmarking Vision-Language Contrastive Methods for Medical Representation LearningCode0
DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition0
Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images0
MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models0
Show:102550
← PrevPage 308 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified