SOTAVerified

Memorization

Papers

Showing 125 of 1088 papers

TitleStatusHype
What Should LLMs Forget? Quantifying Personal Data in LLMs for Right-to-Be-Forgotten Requests0
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data ContaminationCode1
Entropy-Memorization Law: Evaluating Memorization Difficulty of Data in LLMs0
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGICode0
Listener-Rewarded Thinking in VLMs for Image Preferences0
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test0
Leaner Training, Lower Leakage: Revisiting Memorization in LLM Fine-Tuning with LoRA0
Counterfactual Influence as a Distributional Quantity0
Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders0
A Random Matrix Analysis of In-context Memorization for Nonlinear Attention0
Robots and Children that Learn Together : Improving Knowledge Retention by Teaching Peer-Like Interactive Robots0
In-Context Learning Strategies Emerge Rationally0
Winter Soldier: Backdooring Language Models at Pre-Training with Indirect Data Poisoning0
Less is More: Undertraining Experts Improves Model Upcycling0
Dataset distillation for memorized data: Soft labels can leak held-out teacher knowledgeCode0
Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World DataCode0
LexiMark: Robust Watermarking via Lexical Substitutions to Enhance Membership Verification of an LLM's Textual Training DataCode0
Sharpness-Aware Machine Unlearning0
The SWE-Bench Illusion: When State-of-the-Art LLMs Remember Instead of Reason0
Restoring Gaussian Blurred Face Images for Deanonymization Attacks0
SoK: Data Reconstruction Attacks Against Machine Learning Models: Definition, Metrics, and Benchmark0
Private Memorization Editing: Turning Memorization into a Defense to Strengthen Data Privacy in Large Language ModelsCode0
HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial OptimizationCode2
Generative Modeling of Weights: Generalization or Memorization?Code1
Diffusion models under low-noise regimeCode0
Show:102550
← PrevPage 1 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified