SOTAVerified

Benchmarking

Papers

Showing 851860 of 5548 papers

TitleStatusHype
An Empirical Study of GPT-4o Image Generation CapabilitiesCode1
How to Train Neural Field Representations: A Comprehensive Study and BenchmarkCode1
Addressing Shortcomings in Fair Graph Learning Datasets: Towards a New BenchmarkCode1
AIPerf: Automated machine learning as an AI-HPC benchmarkCode1
Addressing the generalization of 3D registration methods with a featureless baseline and an unbiased benchmarkCode1
CIPCaD-Bench: Continuous Industrial Process datasets for benchmarking Causal Discovery methodsCode1
CIBench: Evaluating Your LLMs with a Code Interpreter PluginCode1
Hyperparameter optimization in deep multi-target predictionCode1
4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBsCode1
CIDEr: Consensus-based Image Description EvaluationCode1
Show:102550
← PrevPage 86 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified