SOTAVerified

Memorization

Papers

Showing 501550 of 1088 papers

TitleStatusHype
Minimum Description Length Hopfield NetworksCode0
Practical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt CalibrationCode2
Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language ModelsCode1
Data Factors for Better Compositional GeneralizationCode0
Preserving Privacy in GANs Against Membership Inference Attack0
From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion ModelsCode0
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language ModelsCode1
Bridging Lottery Ticket and Grokking: Understanding Grokking from Inner Structure of NetworksCode0
The statistical thermodynamics of generative diffusion models: Phase transitions, symmetry breaking and critical instability0
Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding0
SoK: Memorization in General-Purpose Large Language Models0
MoPe: Model Perturbation-based Privacy Attacks on Language Models0
Implications of Annotation Artifacts in Edge Probing Test DatasetsCode0
Copyright Violations and Large Language ModelsCode0
To grok or not to grok: Disentangling generalization and memorization on corrupted algorithmic datasetsCode1
AgentTuning: Enabling Generalized Agent Abilities for LLMsCode3
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks0
Training Dynamics of Deep Network Linear Regions0
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine LearningCode1
Unintended Memorization in Large ASR Models, and How to Mitigate It0
Generation or Replication: Auscultating Audio Latent Diffusion Models0
Combating Label Noise With A General Surrogate Model For Sample Selection0
On the Over-Memorization During Natural, Robust and Catastrophic OverfittingCode0
Why Train More? Effective and Efficient Membership Inference via Memorization0
Beyond Memorization: Violating Privacy Via Inference with Large Language ModelsCode1
Exploring Memorization in Fine-tuned Language Models0
What do larger image classifiers memorise?0
Grokking as Compression: A Nonlinear Complexity Perspective0
Probing Large Language Models from A Human Behavioral Perspective0
The Emergence of Reproducibility and Generalizability in Diffusion ModelsCode1
On Memorization in Diffusion ModelsCode1
Generalization in diffusion models arises from geometry-adaptive harmonic representationsCode1
How Much Training Data is Memorized in Overparameterized Autoencoders? An Inverse Problem Perspective on Memorization Evaluation0
Scaling Laws for Associative Memories0
AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZCode1
On Memorization and Privacy Risks of Sharpness Aware Minimization0
Memorization With Neural Nets: Going Beyond the Worst CaseCode0
Leave-one-out Distinguishability in Machine LearningCode0
LawBench: Benchmarking Legal Knowledge of Large Language ModelsCode2
Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey0
Learning From Noisy Correspondence With Tri-Partition for Cross-Modal Matching0
Extreme Image Transformations Facilitate Robust Latent Object Representations0
Analysis of the Memorization and Generalization Capabilities of AI Agents: Are Continual Learners Robust?0
Collectionless Artificial Intelligence0
Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity0
Do PLMs Know and Understand Ontological Knowledge?Code1
Quantifying and Attributing the Hallucination of Large Language Models via Association Analysis0
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale0
FLM-101B: An Open LLM and How to Train It with $100K Budget0
On the Planning, Search, and Memorization Capabilities of Large Language Models0
Show:102550
← PrevPage 11 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified