| Factored Agents: Decoupling In-Context Learning and Memorization for Robust Tool Use | Mar 29, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction | Mar 29, 2025 | Answer GenerationMemorization | —Unverified | 0 |
| SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning | Mar 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Malicious and Unintentional Disclosure Risks in Large Language Models for Code Generation | Mar 27, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Quantifying the Ease of Reproducing Training Data in Unconditional Diffusion Models | Mar 25, 2025 | Memorization | —Unverified | 0 |
| PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models | Mar 24, 2025 | Computational EfficiencyImage Generation | CodeCode Available | 0 |
| Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them | Mar 20, 2025 | MathMemorization | —Unverified | 0 |
| BLIA: Detect model memorization in binary classification model through passive Label Inference attack | Mar 17, 2025 | Binary ClassificationInference Attack | —Unverified | 0 |
| Empirical Privacy Variance | Mar 16, 2025 | Memorization | —Unverified | 0 |
| PrivacyScalpel: Enhancing LLM Privacy via Interpretable Feature Intervention with Sparse Autoencoders | Mar 14, 2025 | MemorizationPrivacy Preserving | —Unverified | 0 |
| DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation | Mar 13, 2025 | Code Generationmbpp | —Unverified | 0 |
| Trustworthy Machine Learning via Memorization and the Granular Long-Tail: A Survey on Interactions, Tradeoffs, and Beyond | Mar 10, 2025 | AttributeFairness | —Unverified | 0 |
| Pre-Training Meta-Rule Selection Policy for Visual Generative Abductive Learning | Mar 9, 2025 | Memorization | CodeCode Available | 0 |
| Privacy Auditing of Large Language Models | Mar 9, 2025 | Memorization | —Unverified | 0 |
| Mitigating Memorization in LLMs using Activation Steering | Mar 8, 2025 | MemorizationPrivacy Preserving | —Unverified | 0 |
| CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation | Mar 7, 2025 | Image ComprehensionMemorization | —Unverified | 0 |
| Robust Data Watermarking in Language Models by Injecting Fictitious Knowledge | Mar 6, 2025 | Continual PretrainingMemorization | CodeCode Available | 0 |
| Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets | Mar 6, 2025 | BenchmarkingDataset Generation | —Unverified | 0 |
| Privacy-Preserving Fair Synthetic Tabular Data | Mar 4, 2025 | FairnessMemorization | —Unverified | 0 |
| Memorize or Generalize? Evaluating LLM Code Generation with Evolved Questions | Mar 4, 2025 | Code GenerationData Augmentation | —Unverified | 0 |
| Superficial Self-Improved Reasoners Benefit from Model Merging | Mar 3, 2025 | Memorization | —Unverified | 0 |
| Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models | Mar 3, 2025 | MemorizationQuestion Answering | CodeCode Available | 0 |
| Asynchronous Personalized Federated Learning through Global Memorization | Mar 1, 2025 | Federated LearningMemorization | —Unverified | 0 |
| SolidMark: Evaluating Image Memorization in Generative Models | Mar 1, 2025 | Memorization | CodeCode Available | 0 |
| Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal | Feb 26, 2025 | Dataset GenerationKnowledge Graphs | —Unverified | 0 |
| On the Interpolation Effect of Score Smoothing | Feb 26, 2025 | DenoisingMemorization | —Unverified | 0 |
| IGDA: Interactive Graph Discovery through Large Language Model Agents | Feb 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reasoning with Latent Thoughts: On the Power of Looped Transformers | Feb 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RELICT: A Replica Detection Framework for Medical Image Generation | Feb 24, 2025 | Image GenerationMedical Image Generation | CodeCode Available | 0 |
| On the Dichotomy Between Privacy and Traceability in _p Stochastic Convex Optimization | Feb 24, 2025 | LEMMAMemorization | —Unverified | 0 |
| Swallowing the Poison Pills: Insights from Vulnerability Disparity Among LLMs | Feb 23, 2025 | Data PoisoningDiagnostic | —Unverified | 0 |
| Interrogating LLM design under a fair learning doctrine | Feb 22, 2025 | Memorization | —Unverified | 0 |
| Generative AI Training and Copyright Law | Feb 21, 2025 | Memorization | —Unverified | 0 |
| Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models | Feb 21, 2025 | Memorization | —Unverified | 0 |
| LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning | Feb 20, 2025 | In-Context LearningLong-Context Understanding | —Unverified | 0 |
| Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models | Feb 20, 2025 | HellaSwagMemorization | —Unverified | 0 |
| Quantifying Memorization and Retriever Performance in Retrieval-Augmented Vision-Language Models | Feb 19, 2025 | MemorizationQuestion Answering | —Unverified | 0 |
| None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks | Feb 18, 2025 | MathMemorization | —Unverified | 0 |
| R.R.: Unveiling LLM Training Privacy through Recollection and Ranking | Feb 18, 2025 | Memorization | CodeCode Available | 0 |
| Pruning as a Defense: Reducing Memorization in Large Language Models | Feb 18, 2025 | Memorization | —Unverified | 0 |
| Rethinking Benign Overfitting in Two-Layer Neural Networks | Feb 17, 2025 | Memorization | —Unverified | 0 |
| Continual Learning Should Move Beyond Incremental Classification | Feb 17, 2025 | ClassificationContinual Learning | —Unverified | 0 |
| Logarithmic Width Suffices for Robust Memorization | Feb 16, 2025 | Memorization | —Unverified | 0 |
| Retrieval-augmented Encoders for Extreme Multi-label Text Classification | Feb 15, 2025 | Extreme Multi-Label ClassificationMemorization | —Unverified | 0 |
| The Vendiscope: An Algorithmic Microscope For Data Collections | Feb 15, 2025 | DiversityFormation Energy | —Unverified | 0 |
| Diffusing DeBias: a Recipe for Turning a Bug into a Feature | Feb 13, 2025 | Memorization | —Unverified | 0 |
| The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding | Feb 13, 2025 | In-Context LearningMemorization | CodeCode Available | 0 |
| Redistribute Ensemble Training for Mitigating Memorization in Diffusion Models | Feb 13, 2025 | Image GenerationMemorization | CodeCode Available | 0 |
| Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers | Feb 12, 2025 | BlockingGPU | —Unverified | 0 |