SOTAVerified

Memorization

Papers

Showing 10511075 of 1088 papers

TitleStatusHype
Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle ChallengesCode0
Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMsCode0
ODIM: Outlier Detection via Likelihood of Under-Fitted Generative ModelsCode0
Dynamic Named Entity RecognitionCode0
Olive Oil is Made of Olives, Baby Oil is Made for Babies: Interpreting Noun Compounds using Paraphrases in a Neural ModelCode0
Affective Medical Estimation and Decision Making via Visualized Learning and Deep LearningCode0
DP-RDM: Adapting Diffusion Models to Private Domains Without Fine-TuningCode0
Tackling Noisy Labels with Network Parameter Additive DecompositionCode0
Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1MCode0
Do LLMs Dream of Ontologies?Code0
A Causal View of Entity Bias in (Large) Language ModelsCode0
Robust Data Watermarking in Language Models by Injecting Fictitious KnowledgeCode0
What they do when in doubt: a study of inductive biases in seq2seq learnersCode0
Does fine-tuning GPT-3 with the OpenAI API leak personally-identifiable information?Code0
On Memorization in Probabilistic Deep Generative ModelsCode0
On Memorization in Probabilistic Deep Generative ModelsCode0
Robust Generalization and Safe Query-Specialization in Counterfactual Learning to RankCode0
Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It TeachesCode0
ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of EventsCode0
On the Generalization and Causal Explanation in Self-Supervised LearningCode0
ALIGNet: Partial-Shape Agnostic Alignment via Unsupervised LearningCode0
Causal Cartographer: From Mapping to Reasoning Over Counterfactual WorldsCode0
Towards More Realistic Extraction Attacks: An Adversarial PerspectiveCode0
Rotational Unit of MemoryCode0
The (ab)use of Open Source Code to Train Large Language ModelsCode0
Show:102550
← PrevPage 43 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified