SOTAVerified

Memorization

Papers

Showing 9511000 of 1088 papers

TitleStatusHype
Weak and Strong Gradient Directions: Explaining Memorization, Generalization, and Hardness of Examples at Scale0
State-of-the-Art Augmented NLP Transformer models for direct and single-step retrosynthesisCode1
An Efficient Method of Training Small Models for Regression Problems with Knowledge Distillation0
Do We Need Zero Training Loss After Achieving Zero Training Error?Code1
Improving Generalization by Controlling Label-Noise Information in Neural Network WeightsCode1
Learning Not to Learn in the Presence of Noisy Labels0
Self-Attentive Associative MemoryCode1
A Corrective View of Neural Networks: Representation, Memorization and Learning0
Encoding-based Memory Modules for Recurrent Neural Networks0
EgoMap: Projective mapping and structured egocentric memory for Deep RL0
Towards GAN Benchmarks Which Require Generalization0
Online Memorization of Random Firing Sequences by a Recurrent Neural Network0
Searching to Exploit Memorization Effect in Learning with Noisy Labels0
Learning Human Postural Control with Hierarchical Acquisition Functions0
The Labeling Distribution Matrix (LDM): A Tool for Estimating Machine Learning Algorithm Capacity0
Variational Recurrent Models for Solving Partially Observable Control TasksCode0
Towards Robust Learning with Different Label Noise DistributionsCode0
Meta-Learning without MemorizationCode0
Semantic Mask for Transformer based End-to-End Speech RecognitionCode0
Neural Networks Learning and Memorization with (almost) no Over-Parameterization0
Neural Network Memorization Dissection0
Searching to Exploit Memorization Effect in Learning from Corrupted LabelsCode0
Generalization through Memorization: Nearest Neighbor Language ModelsCode1
On the Unintended Social Bias of Training Language Generation Models with Data from Local Media0
Robust Training with Ensemble Consensus0
Improving the Gating Mechanism of Recurrent Neural NetworksCode0
Overparameterized Neural Networks Implement Associative MemoryCode0
A Simple Approach to the Noisy Label Problem Through the Gambler's Loss0
On the Unintended Social Bias of Training Language Generation Models with News Articles0
Overparameterized Neural Networks Can Implement Associative Memory0
The Sooner The Better: Investigating Structure of Early Winning Lottery Tickets0
Assessing Generalization in TD methods for Deep Reinforcement Learning0
Autoencoder-based Initialization for Recurrent Neural Networks with a Linear Memory0
Fault-Diagnosing SLAM for Varying Scale Change Detection0
Span Selection Pre-training for Question AnsweringCode0
Learning sparsity in reservoir computing through a novel bio-inspired algorithm0
Circuit-Based Intrinsic Methods to Detect Overfitting0
Stolen Memories: Leveraging Model Memorization for Calibrated White-Box Membership Inference0
On the Role of Geometry in Geo-Localization0
Does Learning Require Memorization? A Short Tale about a Long Tail0
Decoupling Gating from Linearity0
Stable Rank Normalization for Improved Generalization in Neural Networks and GANs0
Suppressing Model Overfitting for Image Super-Resolution Networks0
Deep ReLU Networks Have Surprisingly Few Activation Patterns0
DAS3H: Modeling Student Learning and Forgetting for Optimally Scheduling Distributed Practice of SkillsCode0
Lifelong Sequential Modeling with Personalized Memorization for User Response PredictionCode1
Better Generalization with On-the-fly Dataset Denoising0
Downsampling leads to Image Memorization in Convolutional Autoencoders0
Investigating CNNs' Learning Representation under label noise0
Long-Term Vehicle Localization by Recursive Knowledge Distillation0
Show:102550
← PrevPage 20 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified