SOTAVerified

Memorization

Papers

Showing 326350 of 1088 papers

TitleStatusHype
Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundational Models0
Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law0
Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models0
Learning Camouflaged Object Detection from Noisy Pseudo Label0
Bootstrapped Pre-training with Dynamic Identifier Prediction for Generative Retrieval0
Counting in Small Transformers: The Delicate Interplay between Attention and Feed-Forward LayersCode0
Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment0
Extracting Training Data from Document-Based VQA Models0
MUSE: Machine Unlearning Six-Way Evaluation for Language ModelsCode4
UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization0
PII-Compass: Guiding LLM training data extraction prompts towards the target PII via grounding0
Towards More Realistic Extraction Attacks: An Adversarial PerspectiveCode0
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy ReasoningCode1
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?Code2
Detection and Measurement of Syntactic Templates in Generated Text0
Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge UtilizationCode1
Advancing Cross-domain Discriminability in Continual Learning of Vision-Language ModelsCode1
Sonnet or Not, Bot? Poetry Evaluation for Large Models and DatasetsCode1
Generative artificial intelligence in ophthalmology: multimodal retinal images for the diagnosis of Alzheimer's disease with convolutional neural networks0
Enhancing Data Privacy in Large Language Models through Private Association Editing0
SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It)Code1
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted PhenomenonCode1
AlleNoise: large-scale text classification benchmark dataset with real-world label noiseCode1
modeLing: A Novel Dataset for Testing Linguistic Reasoning in Language Models0
Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction0
Show:102550
← PrevPage 14 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified