SOTAVerified

Memorization

Papers

Showing 151200 of 1088 papers

TitleStatusHype
The Vendiscope: An Algorithmic Microscope For Data Collections0
Diffusing DeBias: a Recipe for Turning a Bug into a Feature0
Redistribute Ensemble Training for Mitigating Memorization in Diffusion ModelsCode0
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept UnderstandingCode0
Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers0
Captured by Captions: On Memorization and its Mitigation in CLIP Models0
The Devil is in the Prompts: De-Identification Traces Enhance Memorization Risks in Synthetic Chest X-Ray GenerationCode0
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations0
Mitigating Sensitive Information Leakage in LLMs4Code through Machine Unlearning0
Mitigating Unintended Memorization with LoRA in Federated Learning for LLMsCode1
A Lightweight Method to Disrupt Memorized Sequences in LLM0
Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization0
LIMO: Less is More for ReasoningCode5
An Analysis for Reasoning Bias of Language Models with Small Initialization0
TReMu: Towards Neuro-Symbolic Temporal Reasoning for LLM-Agents with Memory in Multi-Session Dialogues0
Integrating LMM Planners and 3D Skill Policies for Generalizable Manipulation0
Compositional Generalization Requires More Than Disentangled Representations0
Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction0
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training0
FUNU: Boosting Machine Unlearning Efficiency by Filtering Unnecessary Unlearning0
Memorization and Regularization in Generative Diffusion ModelsCode0
Decoding Generalization from Memorization in Deep Neural Networks0
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation0
On the Reasoning Capacity of AI Models and How to Quantify It0
Test-time regression: a unifying framework for designing sequence models with associative memory0
Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection0
Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation SpaceCode0
Enhancing Generalization in Chain of Thought Reasoning for Smaller Models0
CSTA: Spatial-Temporal Causal Adaptive Learning for Exemplar-Free Video Class-Incremental LearningCode0
Modeling Neural Networks with Privacy Using Neural Stochastic Differential Equations0
Analyzing Memorization in Large Language Models through the Lens of Model AttributionCode0
ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of EventsCode0
Knowledge Memorization and Rumination for Pre-trained Model-based Class-Incremental Learning0
Variance-Based Membership Inference Attacks Against Large-Scale Image Captioning Models0
Representation in large language models0
Uncovering Memorization Effect in the Presence of Spurious Correlations0
VideoChat-Flash: Hierarchical Compression for Long-Context Video ModelingCode4
Attributing Culture-Conditioned Generations to Pretraining CorporaCode0
Elucidating Flow Matching ODE Dynamics with Respect to Data Geometries0
The Impact of Input Order Bias on Large Language Models for Software Fault Localization0
Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement FilteringCode1
Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization0
Learning to Generate Gradients for Test-Time Adaptation via Test-Time Training LayersCode1
Accessing the topological properties of human brain functional sub-circuits in Echo State Networks0
Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models' Character Understanding Evaluation0
Knowledge Boundary of Large Language Models: A Survey0
The Impact of Generalization Techniques on the Interplay Among Privacy, Utility, and Fairness in Image Classification0
Understanding and Mitigating Memorization in Diffusion Models for Tabular Data0
Too Big to Fool: Resisting Deception in Language Models0
The Complexity Dynamics of GrokkingCode1
Show:102550
← PrevPage 4 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified