SOTAVerified

Memorization

Papers

Showing 251275 of 1088 papers

TitleStatusHype
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory0
Does Learning Require Memorization? A Short Tale about a Long Tail0
Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?0
Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach0
Exploring prompts to elicit memorization in masked language model-based named entity recognition0
Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning0
Extracting Training Data from Document-Based VQA Models0
Fault-Diagnosing SLAM for Varying Scale Change Detection0
Déjà Vu: an empirical evaluation of the memorization properties of ConvNets0
Constructive Universal Approximation and Finite Sample Memorization by Narrow Deep ReLU Networks0
Deep Learning is Provably Robust to Symmetric Label Noise0
An Efficient Method of Training Small Models for Regression Problems with Knowledge Distillation0
Tight Bounds on the Smallest Eigenvalue of the Neural Tangent Kernel for Deep ReLU Networks0
Decoupling Gating from Linearity0
Decoding Generalization from Memorization in Deep Neural Networks0
Between Randomness and Arbitrariness: Some Lessons for Reliable Machine Learning at Scale0
Exploring Memorization and Copyright Violation in Frontier LLMs: A Study of the New York Times v. OpenAI 2023 Lawsuit0
Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations0
Weak and Strong Gradient Directions: Explaining Memorization, Generalization, and Hardness of Examples at Scale0
An associative memory model with very high memory rate: Image storage by sequential addition learning0
Better Generalization with On-the-fly Dataset Denoising0
Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods0
Data Isotopes for Data Provenance in DNNs0
Is Grokking a Computational Glass Relaxation?0
Data-Copying in Generative Models: A Formal Framework0
Show:102550
← PrevPage 11 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified