SOTAVerified

Memorization

Papers

Showing 301350 of 1088 papers

TitleStatusHype
Autoencoder-based Initialization for Recurrent Neural Networks with a Linear Memory0
A Corrective View of Neural Networks: Representation, Memorization and Learning0
CopyrightShield: Spatial Similarity Guided Backdoor Defense against Copyright Infringement in Diffusion Models0
CopyJudge: Automated Copyright Infringement Identification and Mitigation in Text-to-Image Diffusion Models0
Audio Tagging by Cross Filtering Noisy Labels0
A Memory Model for Question Answering from Streaming Data Supported by Rehearsal and Anticipation of Coreference Information0
Historical Test-time Prompt Tuning for Vision Foundation Models0
Continuous Reinforcement Learning-based Dynamic Difficulty Adjustment in a Visual Working Memory Game0
A Mathematical Framework for Learning Probability Distributions0
Attention Beats Concatenation for Conditioning Neural Fields0
Continual Memory: Can We Reason After Long-Term Memorization?0
Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding0
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers0
Continual Learning Should Move Beyond Incremental Classification0
Asynchronous Personalized Federated Learning through Global Memorization0
Context-based Virtual Adversarial Training for Text Classification with Noisy Labels0
Data-centric NLP Backdoor Defense from the Lens of Memorization0
FFNB: Forgetting-Free Neural Blocks for Deep Continual Visual Learning0
Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization0
ALTBI: Constructing Improved Outlier Detection Models via Optimization of Inlier-Memorization Effect0
Confidence Adaptive Regularization for Deep Learning with Noisy Labels0
GP-MoLFormer: A Foundation Model For Molecular Generation0
Graph Memory Learning: Imitating Lifelong Remembering and Forgetting of Brain Networks0
Geometry of Neural Network Loss Surfaces via Random Matrix Theory0
Computation with Sequences in a Model of the Brain0
Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization0
Compositional Generalization Requires More Than Disentangled Representations0
Extreme Image Transformations Facilitate Robust Latent Object Representations0
Extracting Unlearned Information from LLMs with Activation Steering0
A Comparative Study of Reservoir Computing for Temporal Signal Processing0
Graph Neural Networks Provably Benefit from Structural Information: A Feature Learning Perspective0
Extracting Training Data from Unconditional Diffusion Models0
Extracting Training Data from Document-Based VQA Models0
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State Space Layers0
Extracting memorized pieces of (copyrighted) books from open-weight language models0
FINED: Feed Instance-Wise Information Need with Essential and Disentangled Parametric Knowledge from the Past0
Assessing Generalization in TD methods for Deep Reinforcement Learning0
Factored Agents: Decoupling In-Context Learning and Memorization for Robust Tool Use0
A Spline Theory of Deep Learning0
Fast Parametric Learning with Activation Memorization0
Fault-Diagnosing SLAM for Varying Scale Change Detection0
Assessing Intelligence in Artificial Neural Networks0
Expressive Power of ReLU and Step Networks under Floating-Point Operations0
Federated Nearest Neighbor Machine Translation0
Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning0
Few-Shot Generation of Brain Tumors for Secure and Fair Data Sharing0
Combating Label Noise With A General Surrogate Model For Sample Selection0
Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them0
Co-matching: Combating Noisy Labels by Augmentation Anchoring0
Adaptive Pre-training Data Detection for Large Language Models via Surprising Tokens0
Show:102550
← PrevPage 7 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified