SOTAVerified

Memorization

Papers

Showing 871880 of 1088 papers

TitleStatusHype
Graph Memory Learning: Imitating Lifelong Remembering and Forgetting of Brain Networks0
Graph Neural Networks Provably Benefit from Structural Information: A Feature Learning Perspective0
Grokking as Compression: A Nonlinear Complexity Perspective0
Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding0
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers0
Historical Test-time Prompt Tuning for Vision Foundation Models0
Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal0
Hopfield model with planted patterns: a teacher-student self-supervised learning model0
How BPE Affects Memorization in Transformers0
How Do the Architecture and Optimizer Affect Representation Learning? On the Training Dynamics of Representations in Deep Neural Networks0
Show:102550
← PrevPage 88 of 109Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified