SOTAVerified

Memorization

Papers

Showing 401450 of 1088 papers

TitleStatusHype
Emergent LLM behaviors are observationally equivalent to data leakageCode0
Emergent and Predictable Memorization in Large Language ModelsCode0
Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World DataCode0
Holistic Label Correction for Noisy Multi-Label ClassificationCode0
DAS3H: Modeling Student Learning and Forgetting for Optimally Scheduling Distributed Practice of SkillsCode0
How does Disagreement Help Generalization against Label Corruption?Code0
LOPS: Learning Order Inspired Pseudo-Label Selection for Weakly Supervised Text ClassificationCode0
Long-Tail Theory under Gaussian MixturesCode0
Memorization in Attention-only TransformersCode0
Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion ModelsCode0
Leveraging Unlabeled Data to Track MemorizationCode0
HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine ConversationsCode0
LexiMark: Robust Watermarking via Lexical Substitutions to Enhance Membership Verification of an LLM's Textual Training DataCode0
How to Engage Your Readers? Generating Guiding Questions to Promote Active ReadingCode0
Evaluating LLMs for Quotation Attribution in Literary Texts: A Case Study of LLaMa3Code0
Leave-one-out Distinguishability in Machine LearningCode0
LiDAR-based localization using universal encoding and memory-aware regressionCode0
LLMs and Memorization: On Quality and Specificity of Copyright ComplianceCode0
Dynamic Named Entity RecognitionCode0
Identifying Memorization of Diffusion Models through p-Laplace AnalysisCode0
A Good Score Does not Lead to A Good Generative ModelCode0
Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation SpaceCode0
ModelPred: A Framework for Predicting Trained Model from Training DataCode0
A Probabilistic Fluctuation based Membership Inference Attack for Diffusion ModelsCode0
Dataset distillation for memorized data: Soft labels can leak held-out teacher knowledgeCode0
How Spurious Features Are Memorized: Precise Analysis for Random and NTK FeaturesCode0
Learning to Infer Program SketchesCode0
DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-TuningCode0
Data Watermarking for Sequential Recommender SystemsCode0
Tackling Noisy Labels with Network Parameter Additive DecompositionCode0
Broccoli: Sprinkling Lightweight Vocabulary Learning into Everyday Information DietsCode0
AGITB: A Signal-Level Benchmark for Evaluating Artificial General IntelligenceCode0
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from ServerCode0
Jogging the Memory of Unlearned LLMs Through Targeted Relearning AttacksCode0
Iterative Graph AlignmentCode0
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNsCode0
Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1MCode0
Do LLMs Dream of Ontologies?Code0
Untrained neural networks can demonstrate memorization-independent abstract reasoningCode0
Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion ModelsCode0
Beyond Memorization: A Rigorous Evaluation Framework for Medical Knowledge EditingCode0
Investigating Memorization of Conspiracy Theories in Text GenerationCode0
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGICode0
Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?0
Bounds for the smallest eigenvalue of the NTK for arbitrary spherical data of arbitrary dimension0
Does Learning Require Memorization? A Short Tale about a Long Tail0
Does it Really Generalize Well on Unseen Data? Systematic Evaluation of Relational Triple Extraction Methods0
Bounding Information Leakage in Machine Learning0
DNN or k-NN: That is the Generalize vs. Memorize Question0
Distribution Shift Matters for Knowledge Distillation with Webly Collected Images0
Show:102550
← PrevPage 9 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified