SOTAVerified

Benchmarking

Papers

Showing 13011310 of 5548 papers

TitleStatusHype
Benchmarking Differential Privacy and Federated Learning for BERT ModelsCode1
You are AllSet: A Multiset Function Framework for Hypergraph Neural NetworksCode1
Mutual-Information Based Few-Shot ClassificationCode1
Synthetic Benchmarks for Scientific Research in Explainable Machine LearningCode1
Underwater Image Restoration via Contrastive Learning and a Real-world DatasetCode1
Intrinsic Image HarmonizationCode1
Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic TestingCode1
Understanding and Evaluating Racial Biases in Image CaptioningCode1
Selection of Source Images Heavily Influences the Effectiveness of Adversarial AttacksCode1
Online Learning with Optimism and DelayCode1
Show:102550
← PrevPage 131 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified