SOTAVerified

Memorization

Papers

Showing 851875 of 1088 papers

TitleStatusHype
Per-Example Gradient Regularization Improves Learning Signals from Noisy Data0
PII-Compass: Guiding LLM training data extraction prompts towards the target PII via grounding0
PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation0
Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models0
Positional Description Matters for Transformers Arithmetic0
Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks0
Power System Event Identification based on Deep Neural Network with Information Loading0
PreCog: Exploring the Relation between Memorization and Performance in Pre-trained Language Models0
Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok0
Preserving Privacy in GANs Against Membership Inference Attack0
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNsCode0
Associative Long Short-Term MemoryCode0
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from ServerCode0
Next-token prediction capacity: general upper bounds and a lower bound for transformersCode0
Out-of-Distribution Detection based on In-Distribution Data Patterns Memorization with Modern Hopfield EnergyCode0
Overparameterized Neural Networks Implement Associative MemoryCode0
Schema-Guided Paradigm for Zero-Shot DialogCode0
OWL: Probing Cross-Lingual Recall of Memorized Texts via World LiteratureCode0
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative ModelsCode0
Distributed Associative Memory Network with Memory Refreshing LossCode0
Jogging the Memory of Unlearned LLMs Through Targeted Relearning AttacksCode0
Iterative Graph AlignmentCode0
PANORAMA: A synthetic PII-laced dataset for studying sensitive data memorization in LLMsCode0
Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion ModelsCode0
Infinite dSprites for Disentangled Continual Learning: Separating Memory Edits from GeneralizationCode0
Show:102550
← PrevPage 35 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified