SOTAVerified

Memorization

Papers

Showing 551600 of 1088 papers

TitleStatusHype
Least Squares Maximum and Weighted Generalization-Memorization Machines0
Response: Emergent analogical reasoning in large language modelsCode0
Quantifying and Analyzing Entity-level Memorization in Large Language Models0
Large language models converge toward human-like concept organization0
Black-box Unsupervised Domain Adaptation with Bi-directional Atkinson-Shiffrin MemoryCode0
Continuous Reinforcement Learning-based Dynamic Difficulty Adjustment in a Visual Working Memory Game0
A Probabilistic Fluctuation based Membership Inference Attack for Diffusion ModelsCode0
Smoothness Similarity Regularization for Few-Shot GAN Adaptation0
MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain ConversationCode1
U-Turn Diffusion0
LLaMA-E: Empowering E-commerce Authoring with Object-Interleaved Instruction Following0
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI ToolCode2
3D-EX : A Unified Dataset of Definitions and Dictionary ExamplesCode0
Excitatory/Inhibitory Balance Emerges as a Key Factor for RBN Performance, Overriding Attractor Dynamics0
Training Data Protection with Compositional Diffusion Models0
Arithmetic with Language Models: from Memorization to Computation0
Understanding Activation Patterns in Artificial Neural Networks by Exploring Stochastic Processes0
Does fine-tuning GPT-3 with the OpenAI API leak personally-identifiable information?Code0
Image Synthesis under Limited Data: A Survey and TaxonomyCode1
Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators?0
Gradient-Based Word Substitution for Obstinate Adversarial Examples Generation in Language Models0
Distribution Shift Matters for Knowledge Distillation with Webly Collected Images0
Long-Tail Theory under Gaussian MixturesCode0
What can we learn from Data Leakage and Unlearning for Law?0
Can Neural Network Memorization Be Localized?Code1
An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and CalibrationCode0
Towards Model-Size Agnostic, Compute-Free, Memorization-based Inference of Deep Learning0
Drive Like a Human: Rethinking Autonomous Driving with Large Language ModelsCode2
In-context Autoencoder for Context Compression in a Large Language ModelCode1
Memorization Through the Lens of Curvature of Loss Function Around Samples0
Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence EstimationCode1
DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion ModelsCode1
Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis0
Pseudo-Bag Mixup Augmentation for Multiple Instance Learning-Based Whole Slide Image Classification0
On information captured by neural networks: connections with memorization and generalizationCode1
A Preference-aware Meta-optimization Framework for Personalized Vehicle Energy Consumption EstimationCode1
Graph Neural Networks Provably Benefit from Structural Information: A Feature Learning Perspective0
Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok0
Understanding quantum machine learning also requires rethinking generalizationCode1
Why are state-space models more expressive than n-gram models?0
MILD: Modeling the Instance Learning Dynamics for Learning with Noisy LabelsCode0
Unsupervised Text Embedding Space Generation Using Generative Adversarial Networks for Text SynthesisCode0
Improving Generalization in Meta-Learning via Meta-Gradient AugmentationCode0
Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations0
Can Forward Gradient Match Backpropagation?Code1
Understanding the Effect of the Long Tail on Neural Network Compression0
Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion modelsCode2
Exploring Model Dynamics for Accumulative Poisoning DiscoveryCode0
The Creative Frontier of Generative AI: Managing the Novelty-Usefulness Tradeoff0
Computation with Sequences in a Model of the Brain0
Show:102550
← PrevPage 12 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified