SOTAVerified

Memorization

Papers

Showing 301350 of 1088 papers

TitleStatusHype
Factored Agents: Decoupling In-Context Learning and Memorization for Robust Tool Use0
The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction0
SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning0
Malicious and Unintentional Disclosure Risks in Large Language Models for Code Generation0
Quantifying the Ease of Reproducing Training Data in Unconditional Diffusion Models0
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative ModelsCode0
Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them0
BLIA: Detect model memorization in binary classification model through passive Label Inference attack0
Empirical Privacy Variance0
PrivacyScalpel: Enhancing LLM Privacy via Interpretable Feature Intervention with Sparse Autoencoders0
DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation0
Trustworthy Machine Learning via Memorization and the Granular Long-Tail: A Survey on Interactions, Tradeoffs, and Beyond0
Pre-Training Meta-Rule Selection Policy for Visual Generative Abductive LearningCode0
Privacy Auditing of Large Language Models0
Mitigating Memorization in LLMs using Activation Steering0
CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation0
Robust Data Watermarking in Language Models by Injecting Fictitious KnowledgeCode0
Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets0
Privacy-Preserving Fair Synthetic Tabular Data0
Memorize or Generalize? Evaluating LLM Code Generation with Evolved Questions0
Superficial Self-Improved Reasoners Benefit from Model Merging0
Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language ModelsCode0
Asynchronous Personalized Federated Learning through Global Memorization0
SolidMark: Evaluating Image Memorization in Generative ModelsCode0
Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal0
On the Interpolation Effect of Score Smoothing0
IGDA: Interactive Graph Discovery through Large Language Model Agents0
Reasoning with Latent Thoughts: On the Power of Looped Transformers0
RELICT: A Replica Detection Framework for Medical Image GenerationCode0
On the Dichotomy Between Privacy and Traceability in _p Stochastic Convex Optimization0
Swallowing the Poison Pills: Insights from Vulnerability Disparity Among LLMs0
Interrogating LLM design under a fair learning doctrine0
Generative AI Training and Copyright Law0
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model TrainingCode0
CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models0
LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning0
Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models0
Quantifying Memorization and Retriever Performance in Retrieval-Augmented Vision-Language Models0
None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks0
R.R.: Unveiling LLM Training Privacy through Recollection and RankingCode0
Pruning as a Defense: Reducing Memorization in Large Language Models0
Rethinking Benign Overfitting in Two-Layer Neural Networks0
Continual Learning Should Move Beyond Incremental Classification0
Logarithmic Width Suffices for Robust Memorization0
Retrieval-augmented Encoders for Extreme Multi-label Text Classification0
The Vendiscope: An Algorithmic Microscope For Data Collections0
Diffusing DeBias: a Recipe for Turning a Bug into a Feature0
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept UnderstandingCode0
Redistribute Ensemble Training for Mitigating Memorization in Diffusion ModelsCode0
Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers0
Show:102550
← PrevPage 7 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified