SOTAVerified

Memorization

Papers

Showing 351400 of 1088 papers

TitleStatusHype
A Comparative Study of Reservoir Computing for Temporal Signal Processing0
In-Context Learning Strategies Emerge Rationally0
Small ReLU networks are powerful memorizers: a tight analysis of memorization capacity0
FLM-101B: An Open LLM and How to Train It with $100K Budget0
Investigating CNNs' Learning Representation under label noise0
Extracting Training Data from Unconditional Diffusion Models0
Extracting Training Data from Document-Based VQA Models0
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State Space Layers0
Extracting memorized pieces of (copyrighted) books from open-weight language models0
FINED: Feed Instance-Wise Information Need with Essential and Disentangled Parametric Knowledge from the Past0
Attention Beats Concatenation for Conditioning Neural Fields0
Gradient-Based Word Substitution for Obstinate Adversarial Examples Generation in Language Models0
Assessing Generalization in TD methods for Deep Reinforcement Learning0
FUNU: Boosting Machine Unlearning Efficiency by Filtering Unnecessary Unlearning0
Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments0
Generalizability of Memorization Neural Networks0
A Spline Theory of Deep Learning0
Expressive Power of ReLU and Step Networks under Floating-Point Operations0
Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning0
Combating Label Noise With A General Surrogate Model For Sample Selection0
Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them0
Co-matching: Combating Noisy Labels by Augmentation Anchoring0
Generalization vs. Memorization in the Presence of Statistical Biases in Transformers0
Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data0
Adaptive Pre-training Data Detection for Large Language Models via Surprising Tokens0
Generation or Replication: Auscultating Audio Latent Diffusion Models0
Exploring prompts to elicit memorization in masked language model-based named entity recognition0
Generative AI Training and Copyright Law0
Generative artificial intelligence in ophthalmology: multimodal retinal images for the diagnosis of Alzheimer's disease with convolutional neural networks0
Collectionless Artificial Intelligence0
A Simple Model of Inference Scaling Laws0
Exploring Memorization in Fine-tuned Language Models0
Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization0
A Multi-Perspective Analysis of Memorization in Large Language Models0
Collaborative Learning in General Graphs with Limited Memorization: Complexity, Learnability, and Reliability0
GP-MoLFormer: A Foundation Model For Molecular Generation0
Context-Aware Membership Inference Attacks against Pre-trained Large Language Models0
Exploring Memorization and Copyright Violation in Frontier LLMs: A Study of the New York Times v. OpenAI 2023 Lawsuit0
Exploring Local Memorization in Diffusion Models via Bright Ending Attention0
Graph Neural Networks Provably Benefit from Structural Information: A Feature Learning Perspective0
Coherence and Diversity through Noise: Self-Supervised Paraphrase Generation via Structure-Aware Denoising0
Exploring Fact Memorization and Style Imitation in LLMs Using QLoRA: An Experimental Study and Quality Assessment Methods0
Cross-Domain Generalization Through Memorization: A Study of Nearest Neighbors in Neural Duplicate Question Detection0
Weak and Strong Gradient Directions: Explaining Memorization, Generalization, and Hardness of Examples at Scale0
Codex Hacks HackerRank: Memorization Issues and a Framework for Code Synthesis Evaluation0
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers0
A Simple Approach to the Noisy Label Problem Through the Gambler's Loss0
Improving Model Generalization: A Chinese Named Entity Recognition Case Study0
Improving word alignment for low resource languages using English monolingual SRL0
Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis0
Show:102550
← PrevPage 8 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified