SOTAVerified

Memorization

Papers

Showing 251300 of 1088 papers

TitleStatusHype
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory0
Does Learning Require Memorization? A Short Tale about a Long Tail0
Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?0
Bounds for the smallest eigenvalue of the NTK for arbitrary spherical data of arbitrary dimension0
A solvable generative model with a linear, one-step denoiser0
Generalizability of Memorization Neural Networks0
Generative artificial intelligence in ophthalmology: multimodal retinal images for the diagnosis of Alzheimer's disease with convolutional neural networks0
Déjà Vu: an empirical evaluation of the memorization properties of ConvNets0
Constructive Universal Approximation and Finite Sample Memorization by Narrow Deep ReLU Networks0
Deep Learning is Provably Robust to Symmetric Label Noise0
An Efficient Method of Training Small Models for Regression Problems with Knowledge Distillation0
Tight Bounds on the Smallest Eigenvalue of the Neural Tangent Kernel for Deep ReLU Networks0
Decoupling Gating from Linearity0
Decoding Generalization from Memorization in Deep Neural Networks0
Between Randomness and Arbitrariness: Some Lessons for Reliable Machine Learning at Scale0
Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations0
Small ReLU networks are powerful memorizers: a tight analysis of memorization capacity0
An associative memory model with very high memory rate: Image storage by sequential addition learning0
Better Generalization with On-the-fly Dataset Denoising0
FLM-101B: An Open LLM and How to Train It with $100K Budget0
Data Isotopes for Data Provenance in DNNs0
Is Grokking a Computational Glass Relaxation?0
Data-Copying in Generative Models: A Formal Framework0
An Analysis for Reasoning Bias of Language Models with Small Initialization0
Data Contamination: From Memorization to Exploitation0
Analyzing the Neural Tangent Kernel of Periodically Activated Coordinate Networks0
Bayesian Perspective on Memorization and Reconstruction0
Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning0
Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks0
CSGNN: Conquering Noisy Node labels via Dynamic Class-wise Selection0
A Wearable Social Interaction Aid for Children with Autism0
Few-Shot Generation of Brain Tumors for Secure and Fair Data Sharing0
Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization0
CrossSplit: Mitigating Label Noise Memorization through Data Splitting0
Avoiding Generative Model Writer's Block With Embedding Nudging0
Cross-Domain Generalization Through Memorization: A Study of Nearest Neighbors in Neural Duplicate Question Detection0
Critical Data Size of Language Models from a Grokking Perspective0
Analysis of the Memorization and Generalization Capabilities of AI Agents: Are Continual Learners Robust?0
Counterfactual Memorization in Neural Language Models0
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks0
Counterfactual Influence as a Distributional Quantity0
A Multi-Perspective Analysis of Memorization in Large Language Models0
AdaCap: Adaptive Capacity control for Feed-Forward Neural Networks0
Federated Nearest Neighbor Machine Translation0
FFNB: Forgetting-Free Neural Blocks for Deep Continual Visual Learning0
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation0
Autoencoder-based Initialization for Recurrent Neural Networks with a Linear Memory0
A Corrective View of Neural Networks: Representation, Memorization and Learning0
Extreme Image Transformations Facilitate Robust Latent Object Representations0
CopyrightShield: Spatial Similarity Guided Backdoor Defense against Copyright Infringement in Diffusion Models0
Show:102550
← PrevPage 6 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified