| Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning | Apr 26, 2023 | MemorizationSelf-Supervised Learning | CodeCode Available | 1 |
| Do Language Models Plagiarize? | Mar 15, 2022 | Language ModellingMemorization | CodeCode Available | 1 |
| Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics | Oct 28, 2024 | Arithmetic ReasoningMath | CodeCode Available | 1 |
| Generalization in diffusion models arises from geometry-adaptive harmonic representations | Oct 4, 2023 | DenoisingImage Denoising | CodeCode Available | 1 |
| Generative Evaluation of Complex Reasoning in Large Language Models | Apr 3, 2025 | BenchmarkingMemorization | CodeCode Available | 1 |
| Generative Modeling of Weights: Generalization or Memorization? | Jun 9, 2025 | MemorizationVideo Generation | CodeCode Available | 1 |
| Do We Need Zero Training Loss After Achieving Zero Training Error? | Feb 20, 2020 | Memorization | CodeCode Available | 1 |
| CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding? | Aug 20, 2024 | Code GenerationMemorization | CodeCode Available | 1 |
| Data Contamination Can Cross Language Barriers | Jun 19, 2024 | Memorization | CodeCode Available | 1 |
| DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models | Oct 31, 2023 | MemorizationModel Editing | CodeCode Available | 1 |
| How Do Large Language Models Acquire Factual Knowledge During Pretraining? | Jun 17, 2024 | Memorization | CodeCode Available | 1 |
| DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion Models | Jul 6, 2023 | Memorization | CodeCode Available | 1 |
| AlleNoise: large-scale text classification benchmark dataset with real-world label noise | Jun 24, 2024 | ClassificationLearning with noisy labels | CodeCode Available | 1 |
| Improving Generalization by Controlling Label-Noise Information in Neural Network Weights | Feb 19, 2020 | Data AugmentationGeneralization Bounds | CodeCode Available | 1 |
| SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It) | Jun 25, 2024 | BenchmarkingExperimental Design | CodeCode Available | 1 |
| Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utilization | Jun 27, 2024 | Memorization | CodeCode Available | 1 |
| DISC: Learning From Noisy Labels via Dynamic Instance-Specific Selection and Correction | Jan 1, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Large Scale Knowledge Washing | May 26, 2024 | DecoderMemorization | CodeCode Available | 1 |
| Co-teaching: Robust Training of Deep Neural Networks with Extremely Noisy Labels | Apr 18, 2018 | Image ClassificationLearning with noisy labels | CodeCode Available | 1 |
| Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion models | Jun 5, 2023 | Brain Tumor SegmentationEthics | CodeCode Available | 1 |
| Learning to Generate Novel Scene Compositions from Single Images and Videos | May 12, 2021 | DiversityMemorization | CodeCode Available | 1 |
| Consensual Collaborative Training And Knowledge Distillation Based Facial Expression Recognition Under Noisy Annotations | Jul 10, 2021 | Facial Expression RecognitionFacial Expression Recognition (FER) | CodeCode Available | 1 |
| Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine Learning | Oct 19, 2023 | DiversityMemorization | CodeCode Available | 1 |
| Do PLMs Know and Understand Ontological Knowledge? | Sep 12, 2023 | Logical ReasoningMemorization | CodeCode Available | 1 |
| Continual Variational Autoencoder Learning via Online Cooperative Memorization | Jul 20, 2022 | Continual LearningDiversity | CodeCode Available | 1 |