SOTAVerified

Memorization

Papers

Showing 6170 of 1088 papers

TitleStatusHype
CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?Code1
Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMsCode1
MemBench: Memorized Image Trigger Prompt Dataset for Diffusion ModelsCode1
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy ReasoningCode1
Advancing Cross-domain Discriminability in Continual Learning of Vision-Language ModelsCode1
Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge UtilizationCode1
Sonnet or Not, Bot? Poetry Evaluation for Large Models and DatasetsCode1
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted PhenomenonCode1
SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It)Code1
AlleNoise: large-scale text classification benchmark dataset with real-world label noiseCode1
Show:102550
← PrevPage 7 of 109Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified