| Emergent LLM behaviors are observationally equivalent to data leakage | May 26, 2025 | Memorization | CodeCode Available | 0 | 5 |
| Emergent and Predictable Memorization in Large Language Models | Apr 21, 2023 | Memorization | CodeCode Available | 0 | 5 |
| Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World Data | Jun 17, 2025 | Memorization | CodeCode Available | 0 | 5 |
| Holistic Label Correction for Noisy Multi-Label Classification | Jan 1, 2023 | ClassificationMemorization | CodeCode Available | 0 | 5 |
| DAS3H: Modeling Student Learning and Forgetting for Optimally Scheduling Distributed Practice of Skills | May 14, 2019 | Knowledge TracingMemorization | CodeCode Available | 0 | 5 |
| How does Disagreement Help Generalization against Label Corruption? | Jan 14, 2019 | Learning with noisy labelsMemorization | CodeCode Available | 0 | 5 |
| LOPS: Learning Order Inspired Pseudo-Label Selection for Weakly Supervised Text Classification | May 25, 2022 | MemorizationPseudo Label | CodeCode Available | 0 | 5 |
| Long-Tail Theory under Gaussian Mixtures | Jul 20, 2023 | Memorization | CodeCode Available | 0 | 5 |
| Memorization in Attention-only Transformers | Nov 15, 2024 | Memorization | CodeCode Available | 0 | 5 |
| Capacity Control is an Effective Memorization Mitigation Mechanism in Text-Conditional Diffusion Models | Oct 29, 2024 | Memorizationparameter-efficient fine-tuning | CodeCode Available | 0 | 5 |
| Leveraging Unlabeled Data to Track Memorization | Dec 8, 2022 | Memorization | CodeCode Available | 0 | 5 |
| HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine Conversations | May 23, 2023 | Memorization | CodeCode Available | 0 | 5 |
| LexiMark: Robust Watermarking via Lexical Substitutions to Enhance Membership Verification of an LLM's Textual Training Data | Jun 17, 2025 | Memorization | CodeCode Available | 0 | 5 |
| How to Engage Your Readers? Generating Guiding Questions to Promote Active Reading | Jul 19, 2024 | ArticlesMemorization | CodeCode Available | 0 | 5 |
| Evaluating LLMs for Quotation Attribution in Literary Texts: A Case Study of LLaMa3 | Jun 17, 2024 | Memorization | CodeCode Available | 0 | 5 |
| Leave-one-out Distinguishability in Machine Learning | Sep 29, 2023 | Gaussian ProcessesMemorization | CodeCode Available | 0 | 5 |
| LiDAR-based localization using universal encoding and memory-aware regression | Aug 1, 2022 | Memorizationregression | CodeCode Available | 0 | 5 |
| LLMs and Memorization: On Quality and Specificity of Copyright Compliance | May 28, 2024 | HallucinationMemorization | CodeCode Available | 0 | 5 |
| Dynamic Named Entity Recognition | Feb 16, 2023 | Entity TypingMemorization | CodeCode Available | 0 | 5 |
| Identifying Memorization of Diffusion Models through p-Laplace Analysis | May 13, 2025 | Memorization | CodeCode Available | 0 | 5 |
| A Good Score Does not Lead to A Good Generative Model | Jan 10, 2024 | Density EstimationMemorization | CodeCode Available | 0 | 5 |
| Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation Space | Jan 19, 2025 | Contrastive LearningLearning with noisy labels | CodeCode Available | 0 | 5 |
| ModelPred: A Framework for Predicting Trained Model from Training Data | Nov 24, 2021 | Data ValuationMemorization | CodeCode Available | 0 | 5 |
| A Probabilistic Fluctuation based Membership Inference Attack for Diffusion Models | Aug 23, 2023 | Inference AttackMembership Inference Attack | CodeCode Available | 0 | 5 |
| Dataset distillation for memorized data: Soft labels can leak held-out teacher knowledge | Jun 17, 2025 | Dataset DistillationMemorization | CodeCode Available | 0 | 5 |
| How Spurious Features Are Memorized: Precise Analysis for Random and NTK Features | May 20, 2023 | Learning TheoryMemorization | CodeCode Available | 0 | 5 |
| Learning to Infer Program Sketches | Feb 17, 2019 | MemorizationProgram Synthesis | CodeCode Available | 0 | 5 |
| DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-Tuning | Mar 21, 2024 | MemorizationRetrieval | CodeCode Available | 0 | 5 |
| Data Watermarking for Sequential Recommender Systems | Nov 20, 2024 | MemorizationRecommendation Systems | CodeCode Available | 0 | 5 |
| Tackling Noisy Labels with Network Parameter Additive Decomposition | Mar 20, 2024 | Memorization | CodeCode Available | 0 | 5 |
| Broccoli: Sprinkling Lightweight Vocabulary Learning into Everyday Information Diets | Apr 16, 2021 | Language AcquisitionMemorization | CodeCode Available | 0 | 5 |
| AGITB: A Signal-Level Benchmark for Evaluating Artificial General Intelligence | Apr 6, 2025 | MemorizationResponse Generation | CodeCode Available | 0 | 5 |
| KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server | Oct 8, 2024 | Federated LearningKnowledge Distillation | CodeCode Available | 0 | 5 |
| Jogging the Memory of Unlearned LLMs Through Targeted Relearning Attacks | Jun 19, 2024 | ArticlesMachine Unlearning | CodeCode Available | 0 | 5 |
| Iterative Graph Alignment | Aug 29, 2024 | DiversityMemorization | CodeCode Available | 0 | 5 |
| Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs | Mar 28, 2024 | counterfactualMemorization | CodeCode Available | 0 | 5 |
| Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1M | May 15, 2025 | BenchmarkingMemorization | CodeCode Available | 0 | 5 |
| Do LLMs Dream of Ontologies? | Jan 26, 2024 | Memorization | CodeCode Available | 0 | 5 |
| Untrained neural networks can demonstrate memorization-independent abstract reasoning | Jul 25, 2024 | MemorizationVisual Reasoning | CodeCode Available | 0 | 5 |
| Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models | Jul 22, 2024 | Data AugmentationMemorization | CodeCode Available | 0 | 5 |
| Beyond Memorization: A Rigorous Evaluation Framework for Medical Knowledge Editing | Jun 4, 2025 | knowledge editingMemorization | CodeCode Available | 0 | 5 |
| Investigating Memorization of Conspiracy Theories in Text Generation | Jan 2, 2021 | HallucinationMemorization | CodeCode Available | 0 | 5 |
| MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI | Jun 30, 2025 | Memorization | CodeCode Available | 0 | 5 |
| Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion? | Nov 15, 2023 | Knowledge Graph CompletionKnowledge Graphs | —Unverified | 0 | 0 |
| Bounds for the smallest eigenvalue of the NTK for arbitrary spherical data of arbitrary dimension | May 23, 2024 | Memorization | —Unverified | 0 | 0 |
| Does Learning Require Memorization? A Short Tale about a Long Tail | Jun 12, 2019 | MemorizationModel Compression | —Unverified | 0 | 0 |
| Does it Really Generalize Well on Unseen Data? Systematic Evaluation of Relational Triple Extraction Methods | Jul 1, 2022 | Knowledge GraphsMemorization | —Unverified | 0 | 0 |
| Bounding Information Leakage in Machine Learning | May 9, 2021 | AttributeBIG-bench Machine Learning | —Unverified | 0 | 0 |
| DNN or k-NN: That is the Generalize vs. Memorize Question | May 17, 2018 | Memorization | —Unverified | 0 | 0 |
| Distribution Shift Matters for Knowledge Distillation with Webly Collected Images | Jul 21, 2023 | Contrastive LearningData-free Knowledge Distillation | —Unverified | 0 | 0 |