SOTAVerified

Benchmarking

Papers

Showing 801810 of 5548 papers

TitleStatusHype
Massively Multi-Cultural Knowledge Acquisition & LM BenchmarkingCode1
Explainable Global Wildfire Prediction Models using Graph Neural NetworksCode1
Retrieve, Merge, Predict: Augmenting Tables with Data LakesCode1
Improved off-policy training of diffusion samplersCode1
JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill MatchingCode1
GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge LearningCode1
We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation BaselineCode1
Benchmarking Transferable Adversarial AttacksCode1
Explainable Benchmarking for Iterative Optimization HeuristicsCode1
Category-wise Fine-Tuning: Resisting Incorrect Pseudo-Labels in Multi-Label Image Classification with Partial LabelsCode1
Show:102550
← PrevPage 81 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified