SOTAVerified

Memorization

Papers

Showing 301350 of 1088 papers

TitleStatusHype
Memorization of Named Entities in Fine-tuned BERT ModelsCode0
Memorization in Deep Neural Networks: Does the Loss Function matter?Code0
Associative Long Short-Term MemoryCode0
Memorization in Attention-only TransformersCode0
Memorization vs. Generalization: Quantifying Data Leakage in NLP Performance EvaluationCode0
Memorization Capacity of Multi-Head Attention in TransformersCode0
Memorization and Regularization in Generative Diffusion ModelsCode0
Memorization-Dilation: Modeling Neural Collapse Under Label NoiseCode0
Memorization and Generalization in Neural Code Intelligence ModelsCode0
Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural NetsCode0
Memorization and Knowledge Injection in Gated LLMsCode0
Memorization With Neural Nets: Going Beyond the Worst CaseCode0
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space LayersCode0
MemControl: Mitigating Memorization in Diffusion Models via Automated Parameter SelectionCode0
Long-Tail Theory under Gaussian MixturesCode0
LOPS: Learning Order Inspired Pseudo-Label Selection for Weakly Supervised Text ClassificationCode0
Exploring Model Dynamics for Accumulative Poisoning DiscoveryCode0
A Closer Look at Memorization in Deep NetworksCode0
LiDAR-based localization using universal encoding and memory-aware regressionCode0
Leave-one-out Distinguishability in Machine LearningCode0
Leveraging Unlabeled Data to Track MemorizationCode0
ALIGNet: Partial-Shape Agnostic Alignment via Unsupervised LearningCode0
Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation SpaceCode0
LexiMark: Robust Watermarking via Lexical Substitutions to Enhance Membership Verification of an LLM's Textual Training DataCode0
LLMs and Memorization: On Quality and Specificity of Copyright ComplianceCode0
Memory-Modular Classification: Learning to Generalize with Memory ReplacementCode0
Excess Capacity and Backdoor PoisoningCode0
Learning to Infer Program SketchesCode0
Evaluating and Explaining Natural Language Generation with GenXCode0
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?Code0
ModelPred: A Framework for Predicting Trained Model from Training DataCode0
Classifier-Free Guidance inside the Attraction Basin May Cause MemorizationCode0
ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of EventsCode0
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from ServerCode0
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language ModelsCode0
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNsCode0
Extreme Memorization via Scale of InitializationCode0
Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and CredibilityCode0
Enhancing elusive clues in knowledge learning by contrasting attention of language modelsCode0
Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMsCode0
Conditionally Strongly Log-Concave Generative ModelsCode0
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy LabelsCode0
Untrained neural networks can demonstrate memorization-independent abstract reasoningCode0
Enhanced Membership Inference Attacks against Machine Learning ModelsCode0
Causal Cartographer: From Mapping to Reasoning Over Counterfactual WorldsCode0
Fewer Errors, but More Stereotypes? The Effect of Model Size on Gender BiasCode0
Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion ModelsCode0
In-context Learning in Presence of Spurious CorrelationsCode0
Improving the Gating Mechanism of Recurrent Neural NetworksCode0
Improving Generalization in Meta-Learning via Meta-Gradient AugmentationCode0
Show:102550
← PrevPage 7 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified