SOTAVerified

Benchmarking

Papers

Showing 951960 of 5548 papers

TitleStatusHype
Benchmarking Differential Privacy and Federated Learning for BERT ModelsCode1
Accelerated and interpretable oblique random survival forestsCode1
Explainable Benchmarking for Iterative Optimization HeuristicsCode1
Benchmarking Distribution Shift in Tabular Data with TableShiftCode1
DataRec: A Python Library for Standardized and Reproducible Data Management in Recommender SystemsCode1
Dataset and Benchmark: Novel Sensors for Autonomous Vehicle PerceptionCode1
Working Memory Capacity of ChatGPT: An Empirical StudyCode1
Data Splits and Metrics for Method Benchmarking on Surgical Action Triplet DatasetsCode1
EvalCrafter: Benchmarking and Evaluating Large Video Generation ModelsCode1
Benchmarking Multimodal Knowledge Conflict for Large Multimodal ModelsCode1
Show:102550
← PrevPage 96 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified