| Swallowing the Poison Pills: Insights from Vulnerability Disparity Among LLMs | Feb 23, 2025 | Data PoisoningDiagnostic | —Unverified | 0 |
| Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection | Jan 20, 2025 | MemorizationText Detection | —Unverified | 0 |
| Synthetic Dataset Generation for Privacy-Preserving Machine Learning | Oct 6, 2022 | Dataset Generationimage-classification | —Unverified | 0 |
| T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts | Dec 5, 2024 | BenchmarkingImage Generation | —Unverified | 0 |
| Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization | Feb 5, 2025 | DenoisingMemorization | —Unverified | 0 |
| Targeted Attack on GPT-Neo for the SATML Language Model Data Extraction Challenge | Feb 13, 2023 | Inference AttackLanguage Modeling | —Unverified | 0 |
| Test-time regression: a unifying framework for designing sequence models with associative memory | Jan 21, 2025 | Memorizationregression | —Unverified | 0 |
| Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity | Sep 12, 2023 | MemorizationSemantic Similarity | —Unverified | 0 |
| The Creative Frontier of Generative AI: Managing the Novelty-Usefulness Tradeoff | Jun 6, 2023 | MemorizationTransfer Learning | —Unverified | 0 |
| The Curious Case of Benign Memorization | Oct 25, 2022 | Data AugmentationMemorization | —Unverified | 0 |