SOTAVerified

Memorization

Papers

Showing 351400 of 1088 papers

TitleStatusHype
Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction0
Can LLM Graph Reasoning Generalize beyond Pattern Memorization?Code1
Scaling Laws for Fact Memorization of Large Language Models0
Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Frontier AI Models0
Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning0
Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models0
Jogging the Memory of Unlearned LLMs Through Targeted Relearning AttacksCode0
Data Contamination Can Cross Language BarriersCode1
Extracting Training Data from Unconditional Diffusion Models0
Unmasking the Veil: An Investigation into Concept Ablation for Privacy and Copyright Protection in ImagesCode0
How Do Large Language Models Acquire Factual Knowledge During Pretraining?Code1
Evaluating LLMs for Quotation Attribution in Literary Texts: A Case Study of LLaMa3Code0
Measuring memorization in RLHF for code completion0
Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning0
Automating Pharmacovigilance Evidence Generation: Using Large Language Models to Produce Context-Aware SQL0
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMsCode2
Between Randomness and Arbitrariness: Some Lessons for Reliable Machine Learning at Scale0
Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods0
Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens0
Diffusion Soup: Model Merging for Text-to-Image Diffusion Models0
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-TeachingCode0
Large Language Models Memorize Sensor Datasets! Implications on Human Activity Recognition Research0
What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions0
Memorization in deep learning: A survey0
Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping0
Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion ModelsCode1
LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge SharingCode1
Differentially Private Fine-Tuning of Diffusion Models0
Memorized Images in Diffusion Models share a Subspace that can be Located and Deleted0
A Machine Learning-Based Framework for Assessing Cryptographic Indistinguishability of Lightweight Block Ciphers0
PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant PerturbationsCode1
MemControl: Mitigating Memorization in Diffusion Models via Automated Parameter SelectionCode0
LLMs and Memorization: On Quality and Specificity of Copyright ComplianceCode0
How Do the Architecture and Optimizer Affect Representation Learning? On the Training Dynamics of Representations in Deep Neural Networks0
When does compositional structure yield compositional generalization? A kernel theoryCode0
Large Scale Knowledge WashingCode1
Unsupervised Meta-Learning via In-Context Learning0
The Mosaic Memory of Large Language ModelsCode0
Bounds for the smallest eigenvalue of the NTK for arbitrary spherical data of arbitrary dimension0
Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood DiscrepancyCode1
Next-token prediction capacity: general upper bounds and a lower bound for transformersCode0
FINED: Feed Instance-Wise Information Need with Essential and Disentangled Parametric Knowledge from the Past0
Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs0
Asymptotic theory of in-context learning by linear attentionCode0
A Multi-Perspective Analysis of Memorization in Large Language Models0
Rethinking Graph Backdoor Attacks: A Distribution-Preserving PerspectiveCode1
Learnable Privacy Neurons Localization in Language Models0
Generalized Holographic Reduced Representations0
Dynamic Loss Decay based Robust Oriented Object Detection on Remote Sensing Images with Noisy Labels0
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory0
Show:102550
← PrevPage 8 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified