SOTAVerified

Memorization

Papers

Showing 91100 of 1088 papers

TitleStatusHype
Quantifying Contamination in Evaluating Code Generation Capabilities of Language ModelsCode1
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language ModelsCode1
Copyright Traps for Large Language ModelsCode1
The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward PassesCode1
Erasing Undesirable Influence in Diffusion ModelsCode1
RoleEval: A Bilingual Role Evaluation Benchmark for Large Language ModelsCode1
Negative Pre-aware for Noisy Cross-modal MatchingCode1
Sparse Low-rank Adaptation of Pre-trained Language ModelsCode1
Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language ModelsCode1
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language ModelsCode1
Show:102550
← PrevPage 10 of 109Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified