SOTAVerified

Benchmarking

Papers

Showing 22712280 of 5548 papers

TitleStatusHype
HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device ScenariosCode0
Harmonization Benchmarking Tool for Neuroimaging DatasetsCode0
HOEG: A New Approach for Object-Centric Predictive Process MonitoringCode0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)Code0
Benchmarking Minimax LinkageCode0
Grounding Synthetic Data Evaluations of Language Models in Unsupervised Document CorporaCode0
Grounded Intuition of GPT-Vision's Abilities with Scientific ImagesCode0
A Comparative Analysis of Word-Level Metric Differential Privacy: Benchmarking The Privacy-Utility Trade-offCode0
Guidelines and Benchmarks for Deployment of Deep Learning Models on Smartphones as Real-Time AppsCode0
Grasp Pre-shape Selection by Synthetic Training: Eye-in-hand Shared Control on the Hannes ProsthesisCode0
Show:102550
← PrevPage 228 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified