SOTAVerified

Benchmarking

Papers

Showing 41014110 of 5548 papers

TitleStatusHype
FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks0
Conditional Neural Processes for Molecules0
DyFEn: Agent-Based Fee Setting in Payment Channel Networks0
A Survey of Parameters Associated with the Quality of Benchmarks in NLP0
TweetNERD -- End to End Entity Linking Benchmark for TweetsCode0
Benchmarking Long-tail Generalization with Likelihood SplitsCode0
OpenOOD: Benchmarking Generalized Out-of-Distribution DetectionCode0
Simulated Contextual Bandits for Personalization Tasks from Recommendation DatasetsCode0
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems0
Vote'n'Rank: Revision of Benchmarking with Social Choice TheoryCode0
Show:102550
← PrevPage 411 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified