SOTAVerified

Memorization

Papers

Showing 351375 of 1088 papers

TitleStatusHype
Captured by Captions: On Memorization and its Mitigation in CLIP Models0
The Devil is in the Prompts: De-Identification Traces Enhance Memorization Risks in Synthetic Chest X-Ray GenerationCode0
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations0
Mitigating Sensitive Information Leakage in LLMs4Code through Machine Unlearning0
A Lightweight Method to Disrupt Memorized Sequences in LLM0
An Analysis for Reasoning Bias of Language Models with Small Initialization0
Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization0
TReMu: Towards Neuro-Symbolic Temporal Reasoning for LLM-Agents with Memory in Multi-Session Dialogues0
Integrating LMM Planners and 3D Skill Policies for Generalizable Manipulation0
Compositional Generalization Requires More Than Disentangled Representations0
Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction0
FUNU: Boosting Machine Unlearning Efficiency by Filtering Unnecessary Unlearning0
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training0
Memorization and Regularization in Generative Diffusion ModelsCode0
Decoding Generalization from Memorization in Deep Neural Networks0
On the Reasoning Capacity of AI Models and How to Quantify It0
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation0
Test-time regression: a unifying framework for designing sequence models with associative memory0
Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection0
Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation SpaceCode0
Enhancing Generalization in Chain of Thought Reasoning for Smaller Models0
CSTA: Spatial-Temporal Causal Adaptive Learning for Exemplar-Free Video Class-Incremental LearningCode0
Modeling Neural Networks with Privacy Using Neural Stochastic Differential Equations0
Analyzing Memorization in Large Language Models through the Lens of Model AttributionCode0
ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of EventsCode0
Show:102550
← PrevPage 15 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified