SOTAVerified

Memorization

Papers

Showing 601650 of 1088 papers

TitleStatusHype
Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion modelsCode1
Memorization Capacity of Multi-Head Attention in TransformersCode0
Towards Understanding Clean Generalization and Robust Overfitting in Adversarial Training0
Understanding and Mitigating Copying in Diffusion ModelsCode1
Conditionally Strongly Log-Concave Generative ModelsCode0
Large Language Models Are Not Strong Abstract ReasonersCode1
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive TasksCode1
Training Data Extraction From Pre-trained Language Models: A Survey0
On Influence Functions, Classification Influence, Relative Influence, Memorization and Generalization0
Semantic-Enhanced Differentiable Search Index Inspired by Learning Strategies0
A Causal View of Entity Bias in (Large) Language ModelsCode0
Sources of Hallucination by Large Language Models on Inference TasksCode1
Mitigating Label Noise through Data AmbiguationCode0
Continual Dialogue State Tracking via Example-Guided Question AnsweringCode0
HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine ConversationsCode0
To Copy Rather Than Memorize: A Vertical Learning Paradigm for Knowledge Graph CompletionCode1
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate0
How Spurious Features Are Memorized: Precise Analysis for Random and NTK FeaturesCode0
Memorization for Good: Encryption with Autoregressive Language ModelsCode1
Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and CredibilityCode0
A Memory Model for Question Answering from Streaming Data Supported by Rehearsal and Anticipation of Coreference Information0
Bot or Human? Detecting ChatGPT Imposters with A Single QuestionCode1
PreCog: Exploring the Relation between Memorization and Performance in Pre-trained Language Models0
Surveying Generative AI's Economic Expectations0
Mitigating Approximate Memorization in Language Models via Dissimilarity Learned Policy0
When Newer is Not Better: Does Deep Learning Really Benefit Recommendation From Implicit Feedback?0
Redundancy and Concept Analysis for Code-trained Language Models0
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4Code1
Causal Reasoning and Large Language Models: Opening a New Frontier for CausalityCode2
Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised LearningCode1
Hopfield model with planted patterns: a teacher-student self-supervised learning model0
Emergent and Predictable Memorization in Large Language Models0
Why Does ChatGPT Fall Short in Providing Truthful Answers?0
An Evaluation on Large Language Model Outputs: Discourse and Memorization0
Transition Propagation Graph Neural Networks for Temporal NetworksCode0
When do you need Chain-of-Thought Prompting for ChatGPT?0
Pythia: A Suite for Analyzing Large Language Models Across Training and ScalingCode6
Per-Example Gradient Regularization Improves Learning Signals from Noisy Data0
C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain AdaptationCode1
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion ModelsCode1
Koala: An Index for Quantifying Overlaps with Pre-training Corpora0
Capabilities of GPT-4 on Medical Challenge ProblemsCode1
Memorization Capacity of Neural Networks with Conditional Computation0
Query2doc: Query Expansion with Large Language Models0
Learning the Finer Things: Bayesian Structure Learning at the Instantiation Level0
Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes0
Ancient Chinese Word Segmentation and Part-of-Speech Tagging Using Distant SupervisionCode0
Semiparametric Language Models Are Scalable Continual Learners0
The (ab)use of Open Source Code to Train Large Language ModelsCode0
Data-Copying in Generative Models: A Formal Framework0
Show:102550
← PrevPage 13 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified