| Koala: An Index for Quantifying Overlaps with Pre-training Corpora | Mar 26, 2023 | Memorization | —Unverified | 0 |
| Capabilities of GPT-4 on Medical Challenge Problems | Mar 20, 2023 | counterfactualMemorization | CodeCode Available | 1 |
| Memorization Capacity of Neural Networks with Conditional Computation | Mar 20, 2023 | Memorization | —Unverified | 0 |
| Query2doc: Query Expansion with Large Language Models | Mar 14, 2023 | MemorizationRetrieval | —Unverified | 0 |
| Learning the Finer Things: Bayesian Structure Learning at the Instantiation Level | Mar 8, 2023 | Memorization | —Unverified | 0 |
| Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes | Mar 7, 2023 | geo-localizationImage-Based Localization | —Unverified | 0 |
| Ancient Chinese Word Segmentation and Part-of-Speech Tagging Using Distant Supervision | Mar 3, 2023 | Chinese Word SegmentationMemorization | CodeCode Available | 0 |
| Semiparametric Language Models Are Scalable Continual Learners | Mar 2, 2023 | Continual LearningLanguage Modeling | —Unverified | 0 |
| The (ab)use of Open Source Code to Train Large Language Models | Feb 27, 2023 | Memorization | CodeCode Available | 0 |
| Data-Copying in Generative Models: A Formal Framework | Feb 25, 2023 | Memorization | —Unverified | 0 |