SOTAVerified

Memorization

Papers

Showing 101150 of 1088 papers

TitleStatusHype
Do We Need Zero Training Loss After Achieving Zero Training Error?Code1
Data Contamination Can Cross Language BarriersCode1
Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language ModelsCode1
Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization CorrelationsCode1
Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language ModelsCode1
Hyperspectral Image Super-Resolution with Spectral Mixup and Heterogeneous DatasetsCode1
Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship DetectionCode1
Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement FilteringCode1
Eicient Non-Sampling Factorization Machines for Optimal Context-Aware RecommendationCode1
Beyond Memorization: The Challenge of Random Memory Access in Language ModelsCode1
Beyond Memorization: Violating Privacy Via Inference with Large Language ModelsCode1
Advancing Cross-domain Discriminability in Continual Learning of Vision-Language ModelsCode1
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine LearningCode1
C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain AdaptationCode1
DASH: Warm-Starting Neural Network Training in Stationary Settings without Loss of PlasticityCode1
Memorization for Good: Encryption with Autoregressive Language ModelsCode1
Memorization Precedes Generation: Learning Unsupervised GANs with Memory NetworksCode1
MEOW: MEMOry Supervised LLM Unlearning Via Inverted FactsCode1
Co-teaching: Robust Training of Deep Neural Networks with Extremely Noisy LabelsCode1
Erasing Undesirable Influence in Diffusion ModelsCode1
Multi-center anatomical segmentation with heterogeneous labels via landmark-based modelsCode1
Euler State Networks: Non-dissipative Reservoir ComputingCode1
Graph Convolutional Memory using Topological PriorsCode1
Bot or Human? Detecting ChatGPT Imposters with A Single QuestionCode1
Antipodes of Label Differential Privacy: PATE and ALIBICode1
Execution-Based Evaluation for Open-Domain Code GenerationCode1
How does Transformer Learn Implicit Reasoning?Code1
Image Synthesis under Limited Data: A Survey and TaxonomyCode1
Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language ModelsCode1
Continual Variational Autoencoder Learning via Online Cooperative MemorizationCode1
Generalization through Memorization: Nearest Neighbor Language ModelsCode1
Contrastive Learning with Boosted MemorizationCode1
FreGAN: Exploiting Frequency Components for Training GANs under Limited DataCode1
Can Forward Gradient Match Backpropagation?Code1
Continual Memorization of Factoids in Large Language ModelsCode1
Generative Evaluation of Complex Reasoning in Large Language ModelsCode1
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language ModelsCode1
Can Language Models Follow Multiple Turns of Entangled Instructions?Code1
Can LLM Graph Reasoning Generalize beyond Pattern Memorization?Code1
Can Neural Network Memorization Be Localized?Code1
Capabilities of GPT-4 on Medical Challenge ProblemsCode1
Generalization in diffusion models arises from geometry-adaptive harmonic representationsCode1
Are Large Pre-Trained Language Models Leaking Your Personal Information?Code1
Copyright Traps for Large Language ModelsCode1
A Preference-aware Meta-optimization Framework for Personalized Vehicle Energy Consumption EstimationCode1
GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical ReasoningCode1
FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text CompressionCode1
Few-Shot Single-View 3-D Object Reconstruction with Compositional PriorsCode1
Consensual Collaborative Training And Knowledge Distillation Based Facial Expression Recognition Under Noisy AnnotationsCode1
Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion ModelsCode1
Show:102550
← PrevPage 3 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified