SOTAVerified

Benchmarking

Papers

Showing 48214830 of 5548 papers

TitleStatusHype
Universal Music Representations? Evaluating Foundation Models on World Music CorporaCode0
MM-Soc: Benchmarking Multimodal Large Language Models in Social Media PlatformsCode0
Fluorescence Reference Target Quantitative Analysis LibraryCode0
FLsim: A Modular and Library-Agnostic Simulation Framework for Federated LearningCode0
FlowCyt: A Comparative Study of Deep Learning Approaches for Multi-Class Classification in Flow Cytometry BenchmarkingCode0
SCAM: A Real-World Typographic Robustness Evaluation for Multimodal Foundation ModelsCode0
Benchmarking Sequential Visual Input Reasoning and Prediction in Multimodal Large Language ModelsCode0
FlexMol: A Flexible Toolkit for Benchmarking Molecular Relational LearningCode0
ZNN - A Fast and Scalable Algorithm for Training 3D Convolutional Networks on Multi-Core and Many-Core Shared Memory MachinesCode0
Wildfire spread forecasting with Deep LearningCode0
Show:102550
← PrevPage 483 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified