SOTAVerified

Benchmarking

Papers

Showing 301310 of 5548 papers

TitleStatusHype
COALA: A Practical and Vision-Centric Federated Learning PlatformCode2
EvalGIM: A Library for Evaluating Generative Image ModelsCode2
CoIR: A Comprehensive Benchmark for Code Information Retrieval ModelsCode2
Benchmarking Complex Instruction-Following with Multiple Constraints CompositionCode2
Benchmarking Benchmark Leakage in Large Language ModelsCode2
Extended Agriculture-Vision: An Extension of a Large Aerial Image Dataset for Agricultural Pattern AnalysisCode2
FedGraph: A Research Library and Benchmark for Federated Graph LearningCode2
FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image AnalysisCode2
Fino1: On the Transferability of Reasoning Enhanced LLMs to FinanceCode2
Class-incremental Learning for Time Series: Benchmark and EvaluationCode2
Show:102550
← PrevPage 31 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified