SOTAVerified

Memorization

Papers

Showing 201250 of 1088 papers

TitleStatusHype
When Can Memorization Improve Fairness?0
Underestimated Privacy Risks for Minority Populations in Large Language Model Unlearning0
MemHunter: Automated and Verifiable Memorization Detection at Dataset-scale in LLMs0
The Pitfalls of Memorization: When Memorization Hurts GeneralizationCode1
Exploring Memorization and Copyright Violation in Frontier LLMs: A Study of the New York Times v. OpenAI 2023 Lawsuit0
Robust Noisy Correspondence Learning via Self-Drop and Dual-Weight0
DeMem: Privacy-Enhanced Robust Adversarial Learning via De-MemorizationCode0
Sometimes I am a Tree: Data Drives Unstable Hierarchical GeneralizationCode0
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts0
Understanding Memorization in Generative Models via Sharpness in Probability Landscapes0
Improved Localized Machine Unlearning Through the Lens of Memorization0
Detecting Memorization in Large Language Models0
CopyrightShield: Spatial Similarity Guided Backdoor Defense against Copyright Infringement in Diffusion Models0
LoyalDiffusion: A Diffusion Model Guarding Against Data Replication0
Learned Random Label Predictions as a Neural Network Complexity Metric0
Integrating Functionalities To A System Via Autoencoder Hippocampus Network0
Differential learning kinetics govern the transition from memorization to generalization during in-context learning0
A solvable generative model with a linear, one-step denoiser0
Parameter Efficient Instruction Tuning: An Empirical StudyCode4
Classifier-Free Guidance inside the Attraction Basin May Cause MemorizationCode0
Data Watermarking for Sequential Recommender SystemsCode0
Vertical Validation: Evaluating Implicit Generative Models for Graphs on Thin Support Regions0
Are Large Language Models Memorizing Bug Benchmarks?0
Branches, Assemble! Multi-Branch Cooperation Network for Large-Scale Click-Through Rate Prediction at Taobao0
Education in the Era of Neurosymbolic AI0
Measuring Non-Adversarial Reproduction of Training Data in Large Language Models0
Memorization in Attention-only TransformersCode0
What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?Code1
Model Editing for LLMs4Code: How Far are We?Code0
Continual Memorization of Factoids in Large Language ModelsCode1
Unlearning in- vs. out-of-distribution data in LLMs under gradient-based method0
LSHBloom: Memory-efficient, Extreme-scale Document Deduplication0
Extracting Unlearned Information from LLMs with Activation Steering0
Generalizability of Memorization Neural Networks0
A Geometric Framework for Understanding Memorization in Generative Models0
Dynamic Uncertainty Ranking: Enhancing In-Context Learning for Long-Tail Knowledge in LLMs0
Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian StructureCode1
DASH: Warm-Starting Neural Network Training in Stationary Settings without Loss of PlasticityCode1
Provably Optimal Memory Capacity for Modern Hopfield Models: Transformer-Compatible Dense Associative Memories as Spherical Codes0
On Memorization of Large Language Models in Logical Reasoning0
Exploring Local Memorization in Diffusion Models via Bright Ending Attention0
Investigating Memorization in Video Diffusion Models0
Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion ModelsCode0
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of HeuristicsCode1
Historical Test-time Prompt Tuning for Vision Foundation Models0
Measuring memorization through probabilistic discoverable extraction0
Mixture of Parrots: Experts improve memorization more than reasoning0
Dual-Model Defense: Safeguarding Diffusion Models from Membership Inference Attacks through Disjoint Data Splitting0
Scalability of memorization-based machine unlearningCode1
A Simple Model of Inference Scaling Laws0
Show:102550
← PrevPage 5 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified