SOTAVerified

Memorization

Papers

Showing 51100 of 1088 papers

TitleStatusHype
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement LearningCode4
Sudoku-Bench: Evaluating creative reasoning with Sudoku variantsCode0
Pre-training Large Memory Language Models with Internal and External KnowledgeCode1
Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities0
Protoknowledge Shapes Behaviour of LLMs in Downstream Tasks: Memorization and Generalization with Knowledge Graphs0
SifterNet: A Generalized and Model-Agnostic Trigger Purification Approach0
Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability0
Causal Cartographer: From Mapping to Reasoning Over Counterfactual WorldsCode0
Fragments to Facts: Partial-Information Fragment Inference from LLMsCode0
Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks0
Extracting memorized pieces of (copyrighted) books from open-weight language models0
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge InjectionCode0
Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It TeachesCode0
PANORAMA: A synthetic PII-laced dataset for studying sensitive data memorization in LLMsCode0
Is Grokking a Computational Glass Relaxation?0
Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context LearningCode0
Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1MCode0
Memorization-Compression Cycles Improve Generalization0
Identifying Memorization of Diffusion Models through p-Laplace AnalysisCode0
Enfoque Odychess: Un método dialéctico, constructivista y adaptativo para la enseñanza del ajedrez con inteligencias artificiales generativas0
OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models0
A new membership inference attack that spots memorization in generative and predictive models: Loss-Based with Reference Model algorithm (LBRM)0
Resolving Memorization in Empirical Diffusion Model for Manifold Data in High-Dimensional Spaces0
Memorization or Interpolation ? Detecting LLM Memorization through Input Perturbation Analysis0
Identifying Legal Holdings with LLMs: A Systematic Study of Performance, Scale, and MemorizationCode0
Wide & Deep Learning for Node ClassificationCode0
Seeking to Collide: Online Safety-Critical Scenario Generation for Autonomous Driving with Retrieval Augmented Large Language Models0
EnronQA: Towards Personalized RAG over Private Documents0
Memorization and Knowledge Injection in Gated LLMsCode0
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers0
Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models0
The Memorization Problem: Can We Trust LLMs' Economic Forecasts?0
A mean teacher algorithm for unlearning of language modelsCode0
Memorization: A Close Look at Books0
It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization0
Memorization vs. Reasoning: Updating LLMs with New Knowledge0
Replicating ReLM Results: Validating Large Language Models with ReLM0
LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language ModelsCode2
Large Language Models Could Be Rote Learners0
The Method for Storing Patterns in Neural Networks-Memorization and Recall of QR code Patterns-0
An introduction to memory competitions, records and techniques0
Memory-Modular Classification: Learning to Generalize with Memory ReplacementCode0
AGITB: A Signal-Level Benchmark for Evaluating Artificial General IntelligenceCode0
Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning0
Generative Evaluation of Complex Reasoning in Large Language ModelsCode1
When Reasoning Meets Compression: Benchmarking Compressed Large Reasoning Models on Complex Reasoning Tasks0
GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical ReasoningCode1
CASCADE Your Datasets for Cross-Mode Knowledge Retrieval of Language ModelsCode0
Few-Shot Generation of Brain Tumors for Secure and Fair Data Sharing0
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation0
Show:102550
← PrevPage 2 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified