SOTAVerified

Memorization

Papers

Showing 251300 of 1088 papers

TitleStatusHype
Memorization Capacity of Multi-Head Attention in TransformersCode0
Memorization-Dilation: Modeling Neural Collapse Under Label NoiseCode0
Memorization in Self-Supervised Learning Improves Downstream GeneralizationCode0
Does Your Neural Code Completion Model Use My Code? A Membership Inference ApproachCode0
Data Watermarking for Sequential Recommender SystemsCode0
Memorization and Generalization in Neural Code Intelligence ModelsCode0
Dataset distillation for memorized data: Soft labels can leak held-out teacher knowledgeCode0
Data Origin Inference in Machine LearningCode0
Memorization and Knowledge Injection in Gated LLMsCode0
Data Factors for Better Compositional GeneralizationCode0
Benchmarking Abstract and Reasoning Abilities Through A Theoretical PerspectiveCode0
Adaptive Sample Selection for Robust Learning under Label NoiseCode0
Memorization and Regularization in Generative Diffusion ModelsCode0
Data Contamination: From Memorization to ExploitationCode0
Analyzing Memorization in Large Language Models through the Lens of Model AttributionCode0
DAS3H: Modeling Student Learning and Forgetting for Optimally Scheduling Distributed Practice of SkillsCode0
CSTA: Spatial-Temporal Causal Adaptive Learning for Exemplar-Free Video Class-Incremental LearningCode0
MemControl: Mitigating Memorization in Diffusion Models via Automated Parameter SelectionCode0
Long-Tail Theory under Gaussian MixturesCode0
LLMs and Memorization: On Quality and Specificity of Copyright ComplianceCode0
LOPS: Learning Order Inspired Pseudo-Label Selection for Weakly Supervised Text ClassificationCode0
Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context LearningCode0
Automatic Classification of Attributes in German Adjective-Noun PhrasesCode0
An Inversion-based Measure of Memorization for Diffusion ModelsCode0
Memorization vs. Generalization: Quantifying Data Leakage in NLP Performance EvaluationCode0
On the privacy-utility trade-off in differentially private hierarchical text classificationCode0
Amplifying Training Data Exposure through Fine-Tuning with Pseudo-Labeled MembershipsCode0
Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation SpaceCode0
Copyright Violations and Large Language ModelsCode0
Leave-one-out Distinguishability in Machine LearningCode0
Attributing Culture-Conditioned Generations to Pretraining CorporaCode0
Continual Referring Expression Comprehension via Dual Modular MemorizationCode0
A mean teacher algorithm for unlearning of language modelsCode0
Learning to Infer Program SketchesCode0
Asymptotic theory of in-context learning by linear attentionCode0
Continual Dialogue State Tracking via Example-Guided Question AnsweringCode0
ModelPred: A Framework for Predicting Trained Model from Training DataCode0
Leveraging Unlabeled Data to Track MemorizationCode0
Conserve-Update-Revise to Cure Generalization and Robustness Trade-off in Adversarial TrainingCode0
Memorization of Named Entities in Fine-tuned BERT ModelsCode0
Associative Long Short-Term MemoryCode0
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNsCode0
Conditionally Strongly Log-Concave Generative ModelsCode0
Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMsCode0
Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and CredibilityCode0
Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural NetsCode0
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from ServerCode0
Untrained neural networks can demonstrate memorization-independent abstract reasoningCode0
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space LayersCode0
Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion ModelsCode0
Show:102550
← PrevPage 6 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified