SOTAVerified

Memorization

Papers

Showing 101125 of 1088 papers

TitleStatusHype
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization0
RARE: Retrieval-Augmented Reasoning ModelingCode2
Factored Agents: Decoupling In-Context Learning and Memorization for Robust Tool Use0
SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning0
The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction0
Malicious and Unintentional Disclosure Risks in Large Language Models for Code Generation0
Quantifying the Ease of Reproducing Training Data in Unconditional Diffusion Models0
LoTUS: Large-Scale Machine Unlearning with a Taste of UncertaintyCode1
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative ModelsCode0
Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them0
Can Language Models Follow Multiple Turns of Entangled Instructions?Code1
BLIA: Detect model memorization in binary classification model through passive Label Inference attack0
Empirical Privacy Variance0
PrivacyScalpel: Enhancing LLM Privacy via Interpretable Feature Intervention with Sparse Autoencoders0
DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation0
Trustworthy Machine Learning via Memorization and the Granular Long-Tail: A Survey on Interactions, Tradeoffs, and Beyond0
Pre-Training Meta-Rule Selection Policy for Visual Generative Abductive LearningCode0
Privacy Auditing of Large Language Models0
Mitigating Memorization in LLMs using Activation Steering0
CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation0
Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets0
Robust Data Watermarking in Language Models by Injecting Fictitious KnowledgeCode0
Memorize or Generalize? Evaluating LLM Code Generation with Evolved Questions0
Privacy-Preserving Fair Synthetic Tabular Data0
Superficial Self-Improved Reasoners Benefit from Model Merging0
Show:102550
← PrevPage 5 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified