SOTAVerified

Memorization

Papers

Showing 2650 of 1088 papers

TitleStatusHype
A Decade's Battle on Dataset Bias: Are We There Yet?Code2
LawBench: Benchmarking Legal Knowledge of Large Language ModelsCode2
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMsCode2
Learning explanations that are hard to varyCode2
DS-1000: A Natural and Reliable Benchmark for Data Science Code GenerationCode2
Detecting, Explaining, and Mitigating Memorization in Diffusion ModelsCode2
Decoupling Knowledge from Memorization: Retrieval-augmented Prompt LearningCode2
Drive Like a Human: Rethinking Autonomous Driving with Large Language ModelsCode2
Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion modelsCode2
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI ToolCode2
Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language ModelsCode1
Data Unlearning in Diffusion ModelsCode1
Antipodes of Label Differential Privacy: PATE and ALIBICode1
Advancing Cross-domain Discriminability in Continual Learning of Vision-Language ModelsCode1
Data Contamination Can Cross Language BarriersCode1
DAT: Training Deep Networks Robust To Label-Noise by Matching the Feature DistributionsCode1
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine LearningCode1
C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain AdaptationCode1
Copyright Traps for Large Language ModelsCode1
Contrastive Learning with Boosted MemorizationCode1
A Preference-aware Meta-optimization Framework for Personalized Vehicle Energy Consumption EstimationCode1
Contrast to Divide: Self-Supervised Pre-Training for Learning with Noisy LabelsCode1
Co-teaching: Robust Training of Deep Neural Networks with Extremely Noisy LabelsCode1
Adaptive Early-Learning Correction for Segmentation from Noisy AnnotationsCode1
DASH: Warm-Starting Neural Network Training in Stationary Settings without Loss of PlasticityCode1
Show:102550
← PrevPage 2 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified