SOTAVerified

Memorization

Papers

Showing 551600 of 1088 papers

TitleStatusHype
Thinking Tokens for Language Modeling0
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory0
An Inversion-based Measure of Memorization for Diffusion ModelsCode0
To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language Models0
Exploring prompts to elicit memorization in masked language model-based named entity recognition0
Mothman at SemEval-2024 Task 9: An Iterative System for Chain-of-Thought Prompt Optimization0
Report on the AAPM Grand Challenge on deep generative modeling for learning medical image statistics0
Post-hoc and manifold explanations analysis of facial expression data based on deep learningCode0
Quantifying Memorization and Detecting Training Data of Pre-trained Language Models using Japanese Newspaper0
Rethinking LLM Memorization through the Lens of Adversarial Compression0
Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach0
Reliable Model Watermarking: Defending Against Theft without Compromising on Evasion0
The Positivity of the Neural Tangent Kernel0
The Fault in our Stars: Quality Assessment of Code Generation Benchmarks0
AI Knowledge and Reasoning: Emulating Expert Creativity in Scientific Research0
GP-MoLFormer: A Foundation Model For Molecular Generation0
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization0
What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks0
SoK: A Review of Differentially Private Linear Models For High-Dimensional Data0
Towards Memorization-Free Diffusion Models0
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNsCode0
Protecting Copyrighted Material with Unique Identifiers in Large Language Model TrainingCode0
DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-TuningCode0
Tackling Noisy Labels with Network Parameter Additive DecompositionCode0
Self-generated Replay Memories for Continual Neural Machine TranslationCode0
Uncertainty-Aware Pseudo-Label Filtering for Source-Free Unsupervised Domain AdaptationCode0
Soften to Defend: Towards Adversarial Robustness via Self-Guided Label Refinement0
Ethos: Rectifying Language Models in Orthogonal Parameter Space0
Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMsCode0
LLM-Oriented Retrieval Tuner0
ROME: Memorization Insights from Text, Logits and Representation0
Unveiling Privacy, Memorization, and Input Curvature Links0
Learning Associative Memories with Gradient Descent0
Unified View of Grokking, Double Descent and Emergent Abilities: A Perspective from Circuits Competition0
Retrieval Helps or Hurts? A Deeper Dive into the Efficacy of Retrieval Augmentation to Language ModelsCode0
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?Code0
Amplifying Training Data Exposure through Fine-Tuning with Pseudo-Labeled MembershipsCode0
Towards Uncovering How Large Language Model Works: An Explainability Perspective0
Neural Information Organizing and Processing -- Neural Machines0
Information Complexity of Stochastic Convex Optimization: Applications to Generalization and Memorization0
Social Evolution of Published Text and The Emergence of Artificial Intelligence Through Large Language Models and The Problem of Toxicity and Bias0
Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments0
Wasserstein proximal operators describe score-based generative models and resolve memorization0
Selective Forgetting: Advancing Machine Unlearning Techniques and Evaluation in Language Models0
Revisiting Early-Learning Regularization When Federated Learning Meets Noisy Labels0
Analyzing the Neural Tangent Kernel of Periodically Activated Coordinate Networks0
EMN: Brain-inspired Elastic Memory Network for Quick Domain Adaptive Feature Mapping0
Human-Centered Privacy Research in the Age of Large Language Models0
Déjà Vu Memorization in Vision-Language Models0
Unconditional Latent Diffusion Models Memorize Patient Imaging Data: Implications for Openly Sharing Synthetic DataCode0
Show:102550
← PrevPage 12 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified