SOTAVerified

Benchmarking

Papers

Showing 41014125 of 5548 papers

TitleStatusHype
FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks0
Conditional Neural Processes for Molecules0
DyFEn: Agent-Based Fee Setting in Payment Channel Networks0
A Survey of Parameters Associated with the Quality of Benchmarks in NLP0
TweetNERD -- End to End Entity Linking Benchmark for TweetsCode0
Benchmarking Long-tail Generalization with Likelihood SplitsCode0
OpenOOD: Benchmarking Generalized Out-of-Distribution DetectionCode0
Simulated Contextual Bandits for Personalization Tasks from Recommendation DatasetsCode0
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems0
Vote'n'Rank: Revision of Benchmarking with Social Choice TheoryCode0
A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing0
Quantifying Social Biases Using Templates is Unreliable0
Are All Steps Equally Important? Benchmarking Essentiality Detection of Events0
Is margin all you need? An extensive empirical study of active learning on tabular data0
A Theory of Dynamic Benchmarks0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)Code0
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data0
MEDFAIR: Benchmarking Fairness for Medical ImagingCode0
Detection and Evaluation of Clusters within Sequential Data0
A Framework for Large Scale Synthetic Graph Dataset Generation0
Benchmarking Learnt Radio Localisation under Distribution Shift0
The current state of single-cell proteomics data analysisCode0
DELAD: Deep Landweber-guided deconvolution with Hessian and sparse prior0
Benchmarking Learning Efficiency in Deep Reservoir ComputingCode0
Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video GroundingCode0
Show:102550
← PrevPage 165 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified