SOTAVerified

Memorization

Papers

Showing 921930 of 1088 papers

TitleStatusHype
The Low-Resource Double Bind: An Empirical Study of Pruning for Low-Resource Machine TranslationCode0
When does compositional structure yield compositional generalization? A kernel theoryCode0
Long-Tail Theory under Gaussian MixturesCode0
AGITB: A Signal-Level Benchmark for Evaluating Artificial General IntelligenceCode0
Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and CredibilityCode0
LOPS: Learning Order Inspired Pseudo-Label Selection for Weakly Supervised Text ClassificationCode0
3D-EX : A Unified Dataset of Definitions and Dictionary ExamplesCode0
How Do Your Biomedical Named Entity Recognition Models Generalize to Novel Entities?Code0
Auditing Data Provenance in Text-Generation ModelsCode0
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?Code0
Show:102550
← PrevPage 93 of 109Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified