SOTAVerified

Memorization

Papers

Showing 101150 of 1088 papers

TitleStatusHype
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation0
RARE: Retrieval-Augmented Reasoning ModelingCode2
Factored Agents: Decoupling In-Context Learning and Memorization for Robust Tool Use0
The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction0
SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning0
Malicious and Unintentional Disclosure Risks in Large Language Models for Code Generation0
Quantifying the Ease of Reproducing Training Data in Unconditional Diffusion Models0
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative ModelsCode0
LoTUS: Large-Scale Machine Unlearning with a Taste of UncertaintyCode1
Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them0
Can Language Models Follow Multiple Turns of Entangled Instructions?Code1
BLIA: Detect model memorization in binary classification model through passive Label Inference attack0
Empirical Privacy Variance0
PrivacyScalpel: Enhancing LLM Privacy via Interpretable Feature Intervention with Sparse Autoencoders0
DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation0
Trustworthy Machine Learning via Memorization and the Granular Long-Tail: A Survey on Interactions, Tradeoffs, and Beyond0
Pre-Training Meta-Rule Selection Policy for Visual Generative Abductive LearningCode0
Privacy Auditing of Large Language Models0
Mitigating Memorization in LLMs using Activation Steering0
CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation0
Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets0
Robust Data Watermarking in Language Models by Injecting Fictitious KnowledgeCode0
Memorize or Generalize? Evaluating LLM Code Generation with Evolved Questions0
Privacy-Preserving Fair Synthetic Tabular Data0
Superficial Self-Improved Reasoners Benefit from Model Merging0
Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language ModelsCode0
Data Unlearning in Diffusion ModelsCode1
SolidMark: Evaluating Image Memorization in Generative ModelsCode0
Asynchronous Personalized Federated Learning through Global Memorization0
Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal0
On the Interpolation Effect of Score Smoothing0
On the Dichotomy Between Privacy and Traceability in _p Stochastic Convex Optimization0
RELICT: A Replica Detection Framework for Medical Image GenerationCode0
Reasoning with Latent Thoughts: On the Power of Looped Transformers0
IGDA: Interactive Graph Discovery through Large Language Model Agents0
Swallowing the Poison Pills: Insights from Vulnerability Disparity Among LLMs0
Interrogating LLM design under a fair learning doctrine0
CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models0
Generative AI Training and Copyright Law0
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model TrainingCode0
Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models0
LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning0
Quantifying Memorization and Retriever Performance in Retrieval-Augmented Vision-Language Models0
Pruning as a Defense: Reducing Memorization in Large Language Models0
R.R.: Unveiling LLM Training Privacy through Recollection and RankingCode0
None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks0
Rethinking Benign Overfitting in Two-Layer Neural Networks0
Continual Learning Should Move Beyond Incremental Classification0
Logarithmic Width Suffices for Robust Memorization0
Retrieval-augmented Encoders for Extreme Multi-label Text Classification0
Show:102550
← PrevPage 3 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified