SOTAVerified

Benchmarking

Papers

Showing 961970 of 5548 papers

TitleStatusHype
Challenges and Opportunities in Improving Worst-Group Generalization in Presence of Spurious FeaturesCode1
GADBench: Revisiting and Benchmarking Supervised Graph Anomaly DetectionCode1
Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized CodebaseCode1
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARLCode1
Geometric Deep Learning for Structure-Based Drug Design: A SurveyCode1
causalAssembly: Generating Realistic Production Data for Benchmarking Causal DiscoveryCode1
Beyond Normal: On the Evaluation of Mutual Information EstimatorsCode1
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity QuantificationCode1
OpenDataVal: a Unified Benchmark for Data ValuationCode1
Evaluating Graph Neural Networks for Link Prediction: Current Pitfalls and New BenchmarkingCode1
Show:102550
← PrevPage 97 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified