SOTAVerified

Benchmarking

Papers

Showing 36613670 of 5548 papers

TitleStatusHype
Quantifying Social Biases Using Templates is Unreliable0
ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial ViewpointsCode1
Are All Steps Equally Important? Benchmarking Essentiality Detection of Events0
Is margin all you need? An extensive empirical study of active learning on tabular data0
A Theory of Dynamic Benchmarks0
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)Code0
A Framework for Large Scale Synthetic Graph Dataset Generation0
Benchmarking Learnt Radio Localisation under Distribution Shift0
MEDFAIR: Benchmarking Fairness for Medical ImagingCode0
Show:102550
← PrevPage 367 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified