SOTAVerified

Memorization

Papers

Showing 351400 of 1088 papers

TitleStatusHype
Extreme Image Transformations Facilitate Robust Latent Object Representations0
Extracting Unlearned Information from LLMs with Activation Steering0
Small ReLU networks are powerful memorizers: a tight analysis of memorization capacity0
FLM-101B: An Open LLM and How to Train It with $100K Budget0
Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets0
Improving Meta-learning for Low-resource Text Classification and Generation via Memory Imitation0
Integrating Functionalities To A System Via Autoencoder Hippocampus Network0
Extracting Training Data from Unconditional Diffusion Models0
Extracting Training Data from Document-Based VQA Models0
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State Space Layers0
Attention Beats Concatenation for Conditioning Neural Fields0
Investigating CNNs' Learning Representation under label noise0
Extracting memorized pieces of (copyrighted) books from open-weight language models0
FUNU: Boosting Machine Unlearning Efficiency by Filtering Unnecessary Unlearning0
Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments0
Generalizability of Memorization Neural Networks0
FINED: Feed Instance-Wise Information Need with Essential and Disentangled Parametric Knowledge from the Past0
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers0
Assessing Generalization in TD methods for Deep Reinforcement Learning0
Expressive Power of ReLU and Step Networks under Floating-Point Operations0
Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning0
Combating Label Noise With A General Surrogate Model For Sample Selection0
Generalization vs. Memorization in the Presence of Statistical Biases in Transformers0
Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data0
Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them0
Generation or Replication: Auscultating Audio Latent Diffusion Models0
Co-matching: Combating Noisy Labels by Augmentation Anchoring0
Generative AI Training and Copyright Law0
Generative artificial intelligence in ophthalmology: multimodal retinal images for the diagnosis of Alzheimer's disease with convolutional neural networks0
A Spline Theory of Deep Learning0
Exploring prompts to elicit memorization in masked language model-based named entity recognition0
Collectionless Artificial Intelligence0
Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization0
A Multi-Perspective Analysis of Memorization in Large Language Models0
A Simple Model of Inference Scaling Laws0
GP-MoLFormer: A Foundation Model For Molecular Generation0
Exploring Memorization in Fine-tuned Language Models0
Collaborative Learning in General Graphs with Limited Memorization: Complexity, Learnability, and Reliability0
Context-Aware Membership Inference Attacks against Pre-trained Large Language Models0
Graph Neural Networks Provably Benefit from Structural Information: A Feature Learning Perspective0
Exploring Memorization and Copyright Violation in Frontier LLMs: A Study of the New York Times v. OpenAI 2023 Lawsuit0
Grokking as Compression: A Nonlinear Complexity Perspective0
Cross-Domain Generalization Through Memorization: A Study of Nearest Neighbors in Neural Duplicate Question Detection0
Exploring Local Memorization in Diffusion Models via Bright Ending Attention0
Coherence and Diversity through Noise: Self-Supervised Paraphrase Generation via Structure-Aware Denoising0
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers0
Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods0
Weak and Strong Gradient Directions: Explaining Memorization, Generalization, and Hardness of Examples at Scale0
Codex Hacks HackerRank: Memorization Issues and a Framework for Code Synthesis Evaluation0
A Simple Approach to the Noisy Label Problem Through the Gambler's Loss0
Show:102550
← PrevPage 8 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified