SOTAVerified

Memorization

Papers

Showing 551575 of 1088 papers

TitleStatusHype
The Positivity of the Neural Tangent Kernel0
The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction0
The Separation Capacity of Random Neural Networks0
Uncovering Memorization Effect in the Presence of Spurious Correlations0
The Sooner The Better: Investigating Structure of Early Winning Lottery Tickets0
The statistical thermodynamics of generative diffusion models: Phase transitions, symmetry breaking and critical instability0
The SWE-Bench Illusion: When State-of-the-Art LLMs Remember Instead of Reason0
The Unreasonable Effectiveness of the Class-reversed Sampling in Tail Sample Memorization0
The Vendiscope: An Algorithmic Microscope For Data Collections0
Thinking Tokens for Language Modeling0
Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization0
Three Factors Influencing Minima in SGD0
Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability0
Time-Aware Language Models as Temporal Knowledge Bases0
To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language Models0
Too Big to Fool: Resisting Deception in Language Models0
Extracting Training Data from Unconditional Diffusion Models0
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization0
Towards Differential Relational Privacy and its use in Question Answering0
Towards GAN Benchmarks Which Require Generalization0
Towards Harnessing Feature Embedding for Robust Learning with Noisy Labels0
Towards Memorization-Free Diffusion Models0
Towards Model-Size Agnostic, Compute-Free, Memorization-based Inference of Deep Learning0
Towards the Memorization Effect of Neural Networks in Adversarial Training0
Trade-offs in Data Memorization via Strong Data Processing Inequalities0
Show:102550
← PrevPage 23 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified