SOTAVerified

Benchmarking

Papers

Showing 35013510 of 5548 papers

TitleStatusHype
Large Language Models as Automated Aligners for benchmarking Vision-Language Models0
An Empirical Investigation into Benchmarking Model Multiplicity for Trustworthy Machine Learning: A Case Study on Image Classification0
Dialogue Quality and Emotion Annotations for Customer Support ConversationsCode0
Learning Dynamic Selection and Pricing of Out-of-Home DeliveriesCode0
Automated 3D Tumor Segmentation using Temporal Cubic PatchGAN (TCuP-GAN)0
Creating and Leveraging a Synthetic Dataset of Cloud Optical Thickness Measures for Cloud Detection in MSICode0
A projected nonlinear state-space model for forecasting time series signalsCode0
Benchmarking Toxic Molecule Classification using Graph Neural Networks and Few Shot Learning0
Benchmarking bias: Expanding clinical AI model card to incorporate bias reporting of social and non-social factors0
Deep State-Space Model for Predicting Cryptocurrency Price0
Show:102550
← PrevPage 351 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified