SOTAVerified

Benchmarking

Papers

Showing 851860 of 5548 papers

TitleStatusHype
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale DatasetCode1
CAVIAR: Co-simulation of 6G Communications, 3D Scenarios and AI for Digital TwinsCode1
Addressing Shortcomings in Fair Graph Learning Datasets: Towards a New BenchmarkCode1
CBench: Towards Better Evaluation of Question Answering Over Knowledge GraphsCode1
Addressing the generalization of 3D registration methods with a featureless baseline and an unbiased benchmarkCode1
Benchmarking machine learning models on multi-centre eICU critical care datasetCode1
An Empirical Study on Google Research Football Multi-agent ScenariosCode1
Towards Motion Forecasting with Real-World Perception Inputs: Are End-to-End Approaches Competitive?Code1
AI in Lung Health: Benchmarking Detection and Diagnostic Models Across Multiple CT Scan DatasetsCode1
A Survey of Pathology Foundation Model: Progress and Future DirectionsCode1
Show:102550
← PrevPage 86 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified