SOTAVerified

Benchmarking

Papers

Showing 36513675 of 5548 papers

TitleStatusHype
MTEB: Massive Text Embedding BenchmarkCode4
OpenOOD: Benchmarking Generalized Out-of-Distribution DetectionCode0
Benchmarking Long-tail Generalization with Likelihood SplitsCode0
Simulated Contextual Bandits for Personalization Tasks from Recommendation DatasetsCode0
Vote'n'Rank: Revision of Benchmarking with Social Choice TheoryCode0
DCL-Net: Deep Correspondence Learning Network for 6D Pose EstimationCode1
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems0
Benchmarking saliency methods for chest X-ray interpretationCode1
A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing0
Benchmarking Reinforcement Learning Techniques for Autonomous NavigationCode1
Quantifying Social Biases Using Templates is Unreliable0
ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial ViewpointsCode1
Are All Steps Equally Important? Benchmarking Essentiality Detection of Events0
Is margin all you need? An extensive empirical study of active learning on tabular data0
A Theory of Dynamic Benchmarks0
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)Code0
A Framework for Large Scale Synthetic Graph Dataset Generation0
Benchmarking Learnt Radio Localisation under Distribution Shift0
MEDFAIR: Benchmarking Fairness for Medical ImagingCode0
Detection and Evaluation of Clusters within Sequential Data0
rPPG-Toolbox: Deep Remote PPG ToolboxCode2
The current state of single-cell proteomics data analysisCode0
DELAD: Deep Landweber-guided deconvolution with Hessian and sparse prior0
State-specific protein-ligand complex structure prediction with a multi-scale deep generative modelCode2
Show:102550
← PrevPage 147 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified