| R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning | May 22, 2025 | MemorizationRAG | CodeCode Available | 4 |
| Sudoku-Bench: Evaluating creative reasoning with Sudoku variants | May 22, 2025 | DiversityLogical Reasoning | CodeCode Available | 0 |
| Pre-training Large Memory Language Models with Internal and External Knowledge | May 21, 2025 | Memorization | CodeCode Available | 1 |
| Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities | May 21, 2025 | MemorizationMultilingual NLP | —Unverified | 0 |
| Protoknowledge Shapes Behaviour of LLMs in Downstream Tasks: Memorization and Generalization with Knowledge Graphs | May 21, 2025 | Knowledge GraphsMemorization | —Unverified | 0 |
| SifterNet: A Generalized and Model-Agnostic Trigger Purification Approach | May 20, 2025 | Memorization | —Unverified | 0 |
| Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability | May 20, 2025 | counterfactualMemorization | —Unverified | 0 |
| Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds | May 20, 2025 | Causal Inferencecounterfactual | CodeCode Available | 0 |
| Fragments to Facts: Partial-Information Fragment Inference from LLMs | May 20, 2025 | Memorization | CodeCode Available | 0 |
| Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks | May 19, 2025 | AttributeMemorization | —Unverified | 0 |
| Extracting memorized pieces of (copyrighted) books from open-weight language models | May 18, 2025 | Memorization | —Unverified | 0 |
| Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection | May 18, 2025 | MemorizationWorld Knowledge | CodeCode Available | 0 |
| Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches | May 18, 2025 | FairnessMemorization | CodeCode Available | 0 |
| PANORAMA: A synthetic PII-laced dataset for studying sensitive data memorization in LLMs | May 18, 2025 | ArticlesAttribute | CodeCode Available | 0 |
| Is Grokking a Computational Glass Relaxation? | May 16, 2025 | Memorization | —Unverified | 0 |
| Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context Learning | May 16, 2025 | In-Context LearningMemorization | CodeCode Available | 0 |
| Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1M | May 15, 2025 | BenchmarkingMemorization | CodeCode Available | 0 |
| Memorization-Compression Cycles Improve Generalization | May 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Identifying Memorization of Diffusion Models through p-Laplace Analysis | May 13, 2025 | Memorization | CodeCode Available | 0 |
| Enfoque Odychess: Un método dialéctico, constructivista y adaptativo para la enseñanza del ajedrez con inteligencias artificiales generativas | May 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models | May 7, 2025 | Machine UnlearningMemorization | —Unverified | 0 |
| A new membership inference attack that spots memorization in generative and predictive models: Loss-Based with Reference Model algorithm (LBRM) | May 6, 2025 | ImputationInference Attack | —Unverified | 0 |
| Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces | May 5, 2025 | Memorization | —Unverified | 0 |
| Memorization or Interpolation ? Detecting LLM Memorization through Input Perturbation Analysis | May 5, 2025 | ArticlesHumanEval | —Unverified | 0 |
| Identifying Legal Holdings with LLMs: A Systematic Study of Performance, Scale, and Memorization | May 4, 2025 | Memorization | CodeCode Available | 0 |
| Wide & Deep Learning for Node Classification | May 4, 2025 | ClassificationDeep Learning | CodeCode Available | 0 |
| Seeking to Collide: Online Safety-Critical Scenario Generation for Autonomous Driving with Retrieval Augmented Large Language Models | May 2, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| EnronQA: Towards Personalized RAG over Private Documents | May 1, 2025 | BenchmarkingMemorization | —Unverified | 0 |
| Memorization and Knowledge Injection in Gated LLMs | Apr 30, 2025 | Continual LearningMemorization | CodeCode Available | 0 |
| Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers | Apr 29, 2025 | Data AugmentationKnowledge Graphs | —Unverified | 0 |
| Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models | Apr 25, 2025 | Memorization | —Unverified | 0 |
| The Memorization Problem: Can We Trust LLMs' Economic Forecasts? | Apr 20, 2025 | Memorization | —Unverified | 0 |
| A mean teacher algorithm for unlearning of language models | Apr 18, 2025 | Continual LearningLanguage Modeling | CodeCode Available | 0 |
| Memorization: A Close Look at Books | Apr 17, 2025 | Memorization | —Unverified | 0 |
| It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization | Apr 17, 2025 | AllLanguage Modeling | —Unverified | 0 |
| Memorization vs. Reasoning: Updating LLMs with New Knowledge | Apr 16, 2025 | Memorization | —Unverified | 0 |
| Replicating ReLM Results: Validating Large Language Models with ReLM | Apr 16, 2025 | Memorization | —Unverified | 0 |
| LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models | Apr 14, 2025 | Equation DiscoveryMemorization | CodeCode Available | 2 |
| Large Language Models Could Be Rote Learners | Apr 11, 2025 | MemorizationMMLU | —Unverified | 0 |
| The Method for Storing Patterns in Neural Networks-Memorization and Recall of QR code Patterns- | Apr 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An introduction to memory competitions, records and techniques | Apr 9, 2025 | Memorization | —Unverified | 0 |
| Memory-Modular Classification: Learning to Generalize with Memory Replacement | Apr 8, 2025 | Classificationimage-classification | CodeCode Available | 0 |
| AGITB: A Signal-Level Benchmark for Evaluating Artificial General Intelligence | Apr 6, 2025 | MemorizationResponse Generation | CodeCode Available | 0 |
| Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning | Apr 4, 2025 | Knowledge GraphsMemorization | —Unverified | 0 |
| Generative Evaluation of Complex Reasoning in Large Language Models | Apr 3, 2025 | BenchmarkingMemorization | CodeCode Available | 1 |
| When Reasoning Meets Compression: Benchmarking Compressed Large Reasoning Models on Complex Reasoning Tasks | Apr 2, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical Reasoning | Apr 2, 2025 | Decision MakingDiagnostic | CodeCode Available | 1 |
| CASCADE Your Datasets for Cross-Mode Knowledge Retrieval of Language Models | Apr 2, 2025 | MemorizationRetrieval | CodeCode Available | 0 |
| Few-Shot Generation of Brain Tumors for Secure and Fair Data Sharing | Mar 31, 2025 | Brain Tumor SegmentationData Augmentation | —Unverified | 0 |
| COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation | Mar 31, 2025 | MemorizationVision and Language Navigation | —Unverified | 0 |