SOTAVerified

Memorization

Papers

Showing 101150 of 1088 papers

TitleStatusHype
To grok or not to grok: Disentangling generalization and memorization on corrupted algorithmic datasetsCode1
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine LearningCode1
Beyond Memorization: Violating Privacy Via Inference with Large Language ModelsCode1
The Emergence of Reproducibility and Generalizability in Diffusion ModelsCode1
On Memorization in Diffusion ModelsCode1
Generalization in diffusion models arises from geometry-adaptive harmonic representationsCode1
AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZCode1
Do PLMs Know and Understand Ontological Knowledge?Code1
MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain ConversationCode1
Image Synthesis under Limited Data: A Survey and TaxonomyCode1
Can Neural Network Memorization Be Localized?Code1
In-context Autoencoder for Context Compression in a Large Language ModelCode1
Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence EstimationCode1
DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion ModelsCode1
On information captured by neural networks: connections with memorization and generalizationCode1
A Preference-aware Meta-optimization Framework for Personalized Vehicle Energy Consumption EstimationCode1
Understanding quantum machine learning also requires rethinking generalizationCode1
Can Forward Gradient Match Backpropagation?Code1
Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion modelsCode1
Large Language Models Are Not Strong Abstract ReasonersCode1
Understanding and Mitigating Copying in Diffusion ModelsCode1
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive TasksCode1
To Copy Rather Than Memorize: A Vertical Learning Paradigm for Knowledge Graph CompletionCode1
Sources of Hallucination by Large Language Models on Inference TasksCode1
Memorization for Good: Encryption with Autoregressive Language ModelsCode1
Bot or Human? Detecting ChatGPT Imposters with A Single QuestionCode1
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4Code1
Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised LearningCode1
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion ModelsCode1
C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain AdaptationCode1
Capabilities of GPT-4 on Medical Challenge ProblemsCode1
Progress measures for grokking via mechanistic interpretabilityCode1
DISC: Learning From Noisy Labels via Dynamic Instance-Specific Selection and CorrectionCode1
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric MemoriesCode1
Execution-Based Evaluation for Open-Domain Code GenerationCode1
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language ModelsCode1
Validating Large Language Models with ReLMCode1
Multi-center anatomical segmentation with heterogeneous labels via landmark-based modelsCode1
Robust Training of Graph Neural Networks via Noise GovernanceCode1
MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR PredictionCode1
FreGAN: Exploiting Frequency Components for Training GANs under Limited DataCode1
Understanding Transformer Memorization Recall Through IdiomsCode1
Visual Localization via Few-Shot Scene Region ClassificationCode1
Towards Better Evaluation for Dynamic Link PredictionCode1
Continual Variational Autoencoder Learning via Online Cooperative MemorizationCode1
Large Loss Matters in Weakly Supervised Multi-Label ClassificationCode1
Contrastive Learning with Boosted MemorizationCode1
Memorization in NLP Fine-tuning MethodsCode1
Are Large Pre-Trained Language Models Leaking Your Personal Information?Code1
Towards Understanding Grokking: An Effective Theory of Representation LearningCode1
Show:102550
← PrevPage 3 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified