SOTAVerified

Memorization

Papers

Showing 851900 of 1088 papers

TitleStatusHype
Per-Example Gradient Regularization Improves Learning Signals from Noisy Data0
PII-Compass: Guiding LLM training data extraction prompts towards the target PII via grounding0
PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation0
Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models0
Positional Description Matters for Transformers Arithmetic0
Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks0
Power System Event Identification based on Deep Neural Network with Information Loading0
PreCog: Exploring the Relation between Memorization and Performance in Pre-trained Language Models0
Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok0
Preserving Privacy in GANs Against Membership Inference Attack0
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNsCode0
Associative Long Short-Term MemoryCode0
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from ServerCode0
Next-token prediction capacity: general upper bounds and a lower bound for transformersCode0
Out-of-Distribution Detection based on In-Distribution Data Patterns Memorization with Modern Hopfield EnergyCode0
Overparameterized Neural Networks Implement Associative MemoryCode0
Schema-Guided Paradigm for Zero-Shot DialogCode0
OWL: Probing Cross-Lingual Recall of Memorized Texts via World LiteratureCode0
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative ModelsCode0
Distributed Associative Memory Network with Memory Refreshing LossCode0
Jogging the Memory of Unlearned LLMs Through Targeted Relearning AttacksCode0
Iterative Graph AlignmentCode0
PANORAMA: A synthetic PII-laced dataset for studying sensitive data memorization in LLMsCode0
Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion ModelsCode0
Infinite dSprites for Disentangled Continual Learning: Separating Memory Edits from GeneralizationCode0
Searching to Exploit Memorization Effect in Learning from Corrupted LabelsCode0
Analyzing Memorization in Large Language Models through the Lens of Model AttributionCode0
Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context LearningCode0
Broccoli: Sprinkling Lightweight Vocabulary Learning into Everyday Information DietsCode0
A Good Score Does not Lead to A Good Generative ModelCode0
Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion ModelsCode0
Diffusion models under low-noise regimeCode0
Untrained neural networks can demonstrate memorization-independent abstract reasoningCode0
Learning to Infer Program SketchesCode0
ModelPred: A Framework for Predicting Trained Model from Training DataCode0
Detecting Overfitting of Deep Generative Networks via Latent RecoveryCode0
Introducing Orthogonal Constraint in Structural ProbesCode0
Using Wavelets and Spectral Methods to Study Patterns in Image-Classification DatasetsCode0
In-context Learning in Presence of Spurious CorrelationsCode0
Imputation with Inter-Series Information from Prototypes for Irregular Sampled Time SeriesCode0
Planting and Mitigating Memorized Content in Predictive-Text Language ModelsCode0
Improving the Gating Mechanism of Recurrent Neural NetworksCode0
Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation SpaceCode0
Black-box Unsupervised Domain Adaptation with Bi-directional Atkinson-Shiffrin MemoryCode0
Improving Generalization in Meta-Learning via Meta-Gradient AugmentationCode0
Pointer Value Retrieval: A new benchmark for understanding the limits of neural network generalizationCode0
What Do Compressed Multilingual Machine Translation Models Forget?Code0
Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language ModelsCode0
Leave-one-out Distinguishability in Machine LearningCode0
Self-generated Replay Memories for Continual Neural Machine TranslationCode0
Show:102550
← PrevPage 18 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified