SOTAVerified

Memorization

Papers

Showing 701750 of 1088 papers

TitleStatusHype
The Creative Frontier of Generative AI: Managing the Novelty-Usefulness Tradeoff0
Memorization Capacity of Multi-Head Attention in TransformersCode0
Towards Understanding Clean Generalization and Robust Overfitting in Adversarial Training0
Conditionally Strongly Log-Concave Generative ModelsCode0
On Influence Functions, Classification Influence, Relative Influence, Memorization and Generalization0
Training Data Extraction From Pre-trained Language Models: A Survey0
A Causal View of Entity Bias in (Large) Language ModelsCode0
Semantic-Enhanced Differentiable Search Index Inspired by Learning Strategies0
Mitigating Label Noise through Data AmbiguationCode0
HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine ConversationsCode0
Continual Dialogue State Tracking via Example-Guided Question AnsweringCode0
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate0
How Spurious Features Are Memorized: Precise Analysis for Random and NTK FeaturesCode0
Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and CredibilityCode0
A Memory Model for Question Answering from Streaming Data Supported by Rehearsal and Anticipation of Coreference Information0
PreCog: Exploring the Relation between Memorization and Performance in Pre-trained Language Models0
Surveying Generative AI's Economic Expectations0
When Newer is Not Better: Does Deep Learning Really Benefit Recommendation From Implicit Feedback?0
Mitigating Approximate Memorization in Language Models via Dissimilarity Learned Policy0
Redundancy and Concept Analysis for Code-trained Language Models0
Hopfield model with planted patterns: a teacher-student self-supervised learning model0
Emergent and Predictable Memorization in Large Language Models0
Why Does ChatGPT Fall Short in Providing Truthful Answers?0
An Evaluation on Large Language Model Outputs: Discourse and Memorization0
Transition Propagation Graph Neural Networks for Temporal NetworksCode0
When do you need Chain-of-Thought Prompting for ChatGPT?0
Per-Example Gradient Regularization Improves Learning Signals from Noisy Data0
Koala: An Index for Quantifying Overlaps with Pre-training Corpora0
Memorization Capacity of Neural Networks with Conditional Computation0
Query2doc: Query Expansion with Large Language Models0
Learning the Finer Things: Bayesian Structure Learning at the Instantiation Level0
Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes0
Ancient Chinese Word Segmentation and Part-of-Speech Tagging Using Distant SupervisionCode0
Semiparametric Language Models Are Scalable Continual Learners0
The (ab)use of Open Source Code to Train Large Language ModelsCode0
Data-Copying in Generative Models: A Formal Framework0
What makes a language easy to deep-learn? Deep neural networks and humans similarly benefit from compositional structureCode0
Federated Nearest Neighbor Machine Translation0
Dynamic Named Entity RecognitionCode0
Targeted Attack on GPT-Neo for the SATML Language Model Data Extraction Challenge0
Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization0
Coherence and Diversity through Noise: Self-Supervised Paraphrase Generation via Structure-Aware Denoising0
ResMem: Learn what you can and memorize the rest0
Sharp Lower Bounds on Interpolation by Deep ReLU Neural Networks at Irregularly Spaced Data0
Validation of machine learning based scenario generators0
Out-of-Distribution Detection based on In-Distribution Data Patterns Memorization with Modern Hopfield EnergyCode0
ODIM: Outlier Detection via Likelihood of Under-Fitted Generative ModelsCode0
OT-Filter: An Optimal Transport Filter for Learning With Noisy Labels0
Prototypical Mixing and Retrieval-Based Refinement for Label Noise-Resistant Image RetrievalCode0
Holistic Label Correction for Noisy Multi-Label ClassificationCode0
Show:102550
← PrevPage 15 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified