| Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs | Mar 28, 2024 | counterfactualMemorization | CodeCode Available | 0 |
| Associative Long Short-Term Memory | Feb 9, 2016 | MemorizationRetrieval | CodeCode Available | 0 |
| KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server | Oct 8, 2024 | Federated LearningKnowledge Distillation | CodeCode Available | 0 |
| Next-token prediction capacity: general upper bounds and a lower bound for transformers | May 22, 2024 | DecoderMemorization | CodeCode Available | 0 |
| Out-of-Distribution Detection based on In-Distribution Data Patterns Memorization with Modern Hopfield Energy | Jan 21, 2023 | Computational EfficiencyMemorization | CodeCode Available | 0 |
| Overparameterized Neural Networks Implement Associative Memory | Sep 26, 2019 | MemorizationRetrieval | CodeCode Available | 0 |
| Schema-Guided Paradigm for Zero-Shot Dialog | Jun 13, 2021 | MemorizationTransfer Learning | CodeCode Available | 0 |
| OWL: Probing Cross-Lingual Recall of Memorized Texts via World Literature | May 28, 2025 | Memorization | CodeCode Available | 0 |
| PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models | Mar 24, 2025 | Computational EfficiencyImage Generation | CodeCode Available | 0 |
| Distributed Associative Memory Network with Memory Refreshing Loss | Jul 21, 2020 | MemorizationQuestion Answering | CodeCode Available | 0 |