SOTAVerified

Memorization

Papers

Showing 401450 of 1088 papers

TitleStatusHype
Thinking Tokens for Language Modeling0
HMT: Hierarchical Memory Transformer for Long Context Language ProcessingCode2
An Inversion-based Measure of Memorization for Diffusion ModelsCode0
To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language Models0
Exploring prompts to elicit memorization in masked language model-based named entity recognition0
Mothman at SemEval-2024 Task 9: An Iterative System for Chain-of-Thought Prompt Optimization0
Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics0
Post-hoc and manifold explanations analysis of facial expression data based on deep learningCode0
Quantifying Memorization and Detecting Training Data of Pre-trained Language Models using Japanese Newspaper0
From Matching to Generation: A Survey on Generative Information RetrievalCode3
Rethinking LLM Memorization through the Lens of Adversarial Compression0
Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach0
Reliable Model Watermarking: Defending Against Theft without Compromising on Evasion0
The Positivity of the Neural Tangent Kernel0
Offset Unlearning for Large Language ModelsCode1
MemLLM: Finetuning LLMs to Use An Explicit Read-Write MemoryCode1
The Fault in our Stars: Quality Assessment of Code Generation Benchmarks0
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language ModelsCode1
AI Knowledge and Reasoning: Emulating Expert Creativity in Scientific Research0
GP-MoLFormer: A Foundation Model For Molecular Generation0
What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks0
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization0
SoK: A Review of Differentially Private Linear Models For High-Dimensional Data0
Towards Memorization-Free Diffusion Models0
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNsCode0
Localizing Paragraph Memorization in Language ModelsCode1
Protecting Copyrighted Material with Unique Identifiers in Large Language Model TrainingCode0
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization CorrelationsCode1
A Unified Framework for Model EditingCode1
DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-TuningCode0
Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy DataCode2
Tackling Noisy Labels with Network Parameter Additive DecompositionCode0
Self-generated Replay Memories for Continual Neural Machine TranslationCode0
Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross AttentionCode1
Uncertainty-Aware Pseudo-Label Filtering for Source-Free Unsupervised Domain AdaptationCode0
Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact GuidanceCode1
Soften to Defend: Towards Adversarial Robustness via Self-Guided Label Refinement0
Ethos: Rectifying Language Models in Orthogonal Parameter Space0
A Decade's Battle on Dataset Bias: Are We There Yet?Code2
Beyond Memorization: The Challenge of Random Memory Access in Language ModelsCode1
Elephants Never Forget: Testing Language Models for Memorization of Tabular DataCode1
Quantifying Contamination in Evaluating Code Generation Capabilities of Language ModelsCode1
Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMsCode0
LLM-Oriented Retrieval Tuner0
SciAssess: Benchmarking LLM Proficiency in Scientific Literature AnalysisCode2
ROME: Memorization Insights from Text, Logits and Representation0
Unveiling Privacy, Memorization, and Input Curvature Links0
Learning Associative Memories with Gradient Descent0
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language ModelsCode1
Unified View of Grokking, Double Descent and Emergent Abilities: A Perspective from Circuits Competition0
Show:102550
← PrevPage 9 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified