| Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges | Sep 16, 2024 | Memorization | CodeCode Available | 0 |
| Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs | Mar 5, 2024 | Memorization | CodeCode Available | 0 |
| ODIM: Outlier Detection via Likelihood of Under-Fitted Generative Models | Jan 11, 2023 | MemorizationOutlier Detection | CodeCode Available | 0 |
| Dynamic Named Entity Recognition | Feb 16, 2023 | Entity TypingMemorization | CodeCode Available | 0 |
| Olive Oil is Made of Olives, Baby Oil is Made for Babies: Interpreting Noun Compounds using Paraphrases in a Neural Model | Mar 21, 2018 | MemorizationRelation | CodeCode Available | 0 |
| Affective Medical Estimation and Decision Making via Visualized Learning and Deep Learning | May 9, 2022 | Decision MakingMemorization | CodeCode Available | 0 |
| DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-Tuning | Mar 21, 2024 | MemorizationRetrieval | CodeCode Available | 0 |
| Tackling Noisy Labels with Network Parameter Additive Decomposition | Mar 20, 2024 | Memorization | CodeCode Available | 0 |
| Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1M | May 15, 2025 | BenchmarkingMemorization | CodeCode Available | 0 |
| Do LLMs Dream of Ontologies? | Jan 26, 2024 | Memorization | CodeCode Available | 0 |
| A Causal View of Entity Bias in (Large) Language Models | May 24, 2023 | Machine Reading ComprehensionMemorization | CodeCode Available | 0 |
| Robust Data Watermarking in Language Models by Injecting Fictitious Knowledge | Mar 6, 2025 | Continual PretrainingMemorization | CodeCode Available | 0 |
| What they do when in doubt: a study of inductive biases in seq2seq learners | Jun 26, 2020 | Memorization | CodeCode Available | 0 |
| Does fine-tuning GPT-3 with the OpenAI API leak personally-identifiable information? | Jul 31, 2023 | Memorization | CodeCode Available | 0 |
| On Memorization in Probabilistic Deep Generative Models | Jun 6, 2021 | Density EstimationMemorization | CodeCode Available | 0 |
| On Memorization in Probabilistic Deep Generative Models | Dec 1, 2021 | Density EstimationMemorization | CodeCode Available | 0 |
| Robust Generalization and Safe Query-Specialization in Counterfactual Learning to Rank | Feb 11, 2021 | counterfactualLearning-To-Rank | CodeCode Available | 0 |
| Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches | May 18, 2025 | FairnessMemorization | CodeCode Available | 0 |
| ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events | Jan 6, 2025 | MemorizationNatural Language Understanding | CodeCode Available | 0 |
| On the Generalization and Causal Explanation in Self-Supervised Learning | Oct 1, 2024 | MemorizationSelf-Supervised Learning | CodeCode Available | 0 |
| ALIGNet: Partial-Shape Agnostic Alignment via Unsupervised Learning | Apr 23, 2018 | Memorization | CodeCode Available | 0 |
| Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds | May 20, 2025 | Causal Inferencecounterfactual | CodeCode Available | 0 |
| Towards More Realistic Extraction Attacks: An Adversarial Perspective | Jul 2, 2024 | Memorization | CodeCode Available | 0 |
| Rotational Unit of Memory | Oct 26, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The (ab)use of Open Source Code to Train Large Language Models | Feb 27, 2023 | Memorization | CodeCode Available | 0 |
| On the Over-Memorization During Natural, Robust and Catastrophic Overfitting | Oct 13, 2023 | Memorization | CodeCode Available | 0 |
| R.R.: Unveiling LLM Training Privacy through Recollection and Ranking | Feb 18, 2025 | Memorization | CodeCode Available | 0 |
| On the Privacy Effect of Data Enhancement via the Lens of Memorization | Aug 17, 2022 | Adversarial RobustnessData Augmentation | CodeCode Available | 0 |
| On the privacy-utility trade-off in differentially private hierarchical text classification | Mar 4, 2021 | General ClassificationInference Attack | CodeCode Available | 0 |
| Salvage Reusable Samples from Noisy Data for Robust Learning | Aug 6, 2020 | Memorization | CodeCode Available | 0 |
| Memorization of Named Entities in Fine-tuned BERT Models | Dec 7, 2022 | MemorizationPrivacy Preserving | CodeCode Available | 0 |
| CASCADE Your Datasets for Cross-Mode Knowledge Retrieval of Language Models | Apr 2, 2025 | MemorizationRetrieval | CodeCode Available | 0 |
| Ancient Chinese Word Segmentation and Part-of-Speech Tagging Using Distant Supervision | Mar 3, 2023 | Chinese Word SegmentationMemorization | CodeCode Available | 0 |
| Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World Data | Jun 17, 2025 | Memorization | CodeCode Available | 0 |
| On Training Sample Memorization: Lessons from Benchmarking Generative Modeling with a Large-scale Competition | Jun 6, 2021 | BenchmarkingMemorization | CodeCode Available | 0 |
| Towards Robust Learning with Different Label Noise Distributions | Dec 18, 2019 | MemorizationRepresentation Learning | CodeCode Available | 0 |
| Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning | Feb 11, 2019 | Memorization | CodeCode Available | 0 |
| The Devil is in the Prompts: De-Identification Traces Enhance Memorization Risks in Synthetic Chest X-Ray Generation | Feb 11, 2025 | BenchmarkingDe-identification | CodeCode Available | 0 |