SOTAVerified

Memorization

Papers

Showing 76100 of 1088 papers

TitleStatusHype
LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge SharingCode1
PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant PerturbationsCode1
Large Scale Knowledge WashingCode1
Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood DiscrepancyCode1
Rethinking Graph Backdoor Attacks: A Distribution-Preserving PerspectiveCode1
MemLLM: Finetuning LLMs to Use An Explicit Read-Write MemoryCode1
Offset Unlearning for Large Language ModelsCode1
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language ModelsCode1
Localizing Paragraph Memorization in Language ModelsCode1
A Unified Framework for Model EditingCode1
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization CorrelationsCode1
Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross AttentionCode1
Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact GuidanceCode1
Beyond Memorization: The Challenge of Random Memory Access in Language ModelsCode1
Elephants Never Forget: Testing Language Models for Memorization of Tabular DataCode1
Quantifying Contamination in Evaluating Code Generation Capabilities of Language ModelsCode1
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language ModelsCode1
Copyright Traps for Large Language ModelsCode1
The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward PassesCode1
Erasing Undesirable Influence in Diffusion ModelsCode1
RoleEval: A Bilingual Role Evaluation Benchmark for Large Language ModelsCode1
Negative Pre-aware for Noisy Cross-modal MatchingCode1
Sparse Low-rank Adaptation of Pre-trained Language ModelsCode1
Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language ModelsCode1
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language ModelsCode1
Show:102550
← PrevPage 4 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified