SOTAVerified

Memorization

Papers

Showing 301350 of 1088 papers

TitleStatusHype
Information-Theoretic Progress Measures reveal Grokking is an Emergent Phase Transition0
Understanding the Local Geometry of Generative Model Manifolds0
Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMsCode1
Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization0
Quantifying the Corpus Bias Problem in Automatic Music Transcription SystemsCode0
MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models0
Diffusion Model Meets Non-Exemplar Class-Incremental Learning and Beyond0
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language ModelsCode0
Human foraging strategies flexibly adapt to resource distribution and time constraints0
Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks0
Detecting, Explaining, and Mitigating Memorization in Diffusion ModelsCode2
Adaptive Pre-training Data Detection for Large Language Models via Surprising Tokens0
Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models0
Strong Copyright Protection for Language Models via Adaptive Model Fusion0
LLMs' Understanding of Natural Language Revealed0
Towards Clean-Label Backdoor Attacks in the Physical WorldCode0
Graph Memory Learning: Imitating Lifelong Remembering and Forgetting of Brain Networks0
Untrained neural networks can demonstrate memorization-independent abstract reasoningCode0
Demystifying Verbatim Memorization in Large Language ModelsCode0
MemBench: Memorized Image Trigger Prompt Dataset for Diffusion ModelsCode1
Empirical Capacity Model for Self-Attention Neural Networks0
Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion ModelsCode0
Knowledge Mechanisms in Large Language Models: A Survey and Perspective0
Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data0
How to Engage Your Readers? Generating Guiding Questions to Promote Active ReadingCode0
Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundational Models0
Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law0
Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models0
Learning Camouflaged Object Detection from Noisy Pseudo Label0
Bootstrapped Pre-training with Dynamic Identifier Prediction for Generative Retrieval0
Counting in Small Transformers: The Delicate Interplay between Attention and Feed-Forward LayersCode0
Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment0
Extracting Training Data from Document-Based VQA Models0
MUSE: Machine Unlearning Six-Way Evaluation for Language ModelsCode4
UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization0
PII-Compass: Guiding LLM training data extraction prompts towards the target PII via grounding0
Towards More Realistic Extraction Attacks: An Adversarial PerspectiveCode0
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy ReasoningCode1
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?Code2
Detection and Measurement of Syntactic Templates in Generated Text0
Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge UtilizationCode1
Advancing Cross-domain Discriminability in Continual Learning of Vision-Language ModelsCode1
Sonnet or Not, Bot? Poetry Evaluation for Large Models and DatasetsCode1
Generative artificial intelligence in ophthalmology: multimodal retinal images for the diagnosis of Alzheimer's disease with convolutional neural networks0
Enhancing Data Privacy in Large Language Models through Private Association Editing0
SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It)Code1
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted PhenomenonCode1
AlleNoise: large-scale text classification benchmark dataset with real-world label noiseCode1
modeLing: A Novel Dataset for Testing Linguistic Reasoning in Language Models0
Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction0
Show:102550
← PrevPage 7 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified