SOTAVerified

Memorization

Papers

Showing 801850 of 1088 papers

TitleStatusHype
MoPe: Model Perturbation-based Privacy Attacks on Language Models0
Most Ligand-Based Classification Benchmarks Reward Memorization Rather than Generalization0
Mothman at SemEval-2024 Task 9: An Iterative System for Chain-of-Thought Prompt Optimization0
MSR: Making Self-supervised learning Robust to Aggressive Augmentations0
Multi-Sample Online Learning for Probabilistic Spiking Neural Networks0
Navigating the Latent Space Dynamics of Neural Models0
Network size and size of the weights in memorization with two-layers neural networks0
Network size and weights size for memorization with two-layers neural networks0
Neural Information Organizing and Processing -- Neural Machines0
Neural Network Memorization Dissection0
Neural Networks Learning and Memorization with (almost) no Over-Parameterization0
NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training0
NLIP: Noise-robust Language-Image Pre-training0
NN-grams: Unifying neural network and n-gram language models for Speech Recognition0
Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation0
None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks0
Nonlinear random matrix theory for deep learning0
Not All Knowledge Is Created Equal: Mutual Distillation of Confident Knowledge0
Not-So-CLEVR: Visual Relations Strain Feedforward Neural Networks0
Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models0
OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models0
Olive Oil is Made Olives, Baby Oil is Made Babies: Interpreting Noun Compounds Using Paraphrases in a Neural Model0
On Influence Functions, Classification Influence, Relative Influence, Memorization and Generalization0
Online Learning via Memory: Retrieval-Augmented Detector Adaptation0
Online Memorization of Random Firing Sequences by a Recurrent Neural Network0
On Memorization and Privacy Risks of Sharpness Aware Minimization0
On Memorization of Large Language Models in Logical Reasoning0
On Retrieval Augmentation and the Limitations of Language Model Training0
On the Dichotomy Between Privacy and Traceability in _p Stochastic Convex Optimization0
On the Generalization Mystery in Deep Learning0
On the geometry of generalization and memorization in deep neural networks0
On the Interpolation Effect of Score Smoothing0
On the Memorization Properties of Contrastive Learning0
On the Optimal Memorization Power of ReLU Neural Networks0
On the Planning, Search, and Memorization Capabilities of Large Language Models0
On the Reasoning Capacity of AI Models and How to Quantify It0
On the Robustness of Monte Carlo Dropout Trained with Noisy Labels0
On the Role of Geometry in Geo-Localization0
On the Unintended Social Bias of Training Language Generation Models with Data from Local Media0
On the Unintended Social Bias of Training Language Generation Models with News Articles0
Towards Uncovering How Large Language Model Works: An Explainability Perspective0
Optimal Memorization Capacity of Transformers0
Optimizing Human Learning0
OT-Filter: An Optimal Transport Filter for Learning With Noisy Labels0
Overparameterized Neural Networks Can Implement Associative Memory0
Parameter Compression of Recurrent Neural Networks and Degradation of Short-term Memory0
Parsimonious Inference0
Patch-Based Diffusion Models Beat Whole-Image Models for Mismatched Distribution Inverse Problems0
PDFed: Privacy-Preserving and Decentralized Asynchronous Federated Learning for Diffusion Models0
Perceptron Theory Can Predict the Accuracy of Neural Networks0
Show:102550
← PrevPage 17 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified