| CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding? | Aug 20, 2024 | Code GenerationMemorization | CodeCode Available | 1 |
| Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs | Aug 13, 2024 | Machine UnlearningMemorization | CodeCode Available | 1 |
| MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models | Jul 24, 2024 | Image GenerationMemorization | CodeCode Available | 1 |
| Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning | Jul 1, 2024 | Memorization | CodeCode Available | 1 |
| Advancing Cross-domain Discriminability in Continual Learning of Vision-Language Models | Jun 27, 2024 | Continual LearningIncremental Learning | CodeCode Available | 1 |
| Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utilization | Jun 27, 2024 | Memorization | CodeCode Available | 1 |
| Sonnet or Not, Bot? Poetry Evaluation for Large Models and Datasets | Jun 27, 2024 | FormMemorization | CodeCode Available | 1 |
| Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon | Jun 25, 2024 | Memorization | CodeCode Available | 1 |
| SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It) | Jun 25, 2024 | BenchmarkingExperimental Design | CodeCode Available | 1 |
| AlleNoise: large-scale text classification benchmark dataset with real-world label noise | Jun 24, 2024 | ClassificationLearning with noisy labels | CodeCode Available | 1 |