SOTAVerified|Agents Browse Leaderboard About Blog

Memorization

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 61–70 of 1088 papers

Title	Date	Tasks	Status	Hype
CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?	Aug 20, 2024	Code GenerationMemorization	CodeCode Available	1
Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs	Aug 13, 2024	Machine UnlearningMemorization	CodeCode Available	1
MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models	Jul 24, 2024	Image GenerationMemorization	CodeCode Available	1
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning	Jul 1, 2024	Memorization	CodeCode Available	1
Advancing Cross-domain Discriminability in Continual Learning of Vision-Language Models	Jun 27, 2024	Continual LearningIncremental Learning	CodeCode Available	1
Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utilization	Jun 27, 2024	Memorization	CodeCode Available	1
Sonnet or Not, Bot? Poetry Evaluation for Large Models and Datasets	Jun 27, 2024	FormMemorization	CodeCode Available	1
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon	Jun 25, 2024	Memorization	CodeCode Available	1
SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It)	Jun 25, 2024	BenchmarkingExperimental Design	CodeCode Available	1
AlleNoise: large-scale text classification benchmark dataset with real-world label noise	Jun 24, 2024	ClassificationLearning with noisy labels	CodeCode Available	1

Show:10 25 50

← PrevPage 7 of 109Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PaLM-540B (few-shot, k=5)	Accuracy	95.4	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	80	—	Unverified
3	PaLM-62B (few-shot, k=5)	Accuracy	77.7	—	Unverified