SOTAVerified

Memorization

Papers

Showing 726750 of 1088 papers

TitleStatusHype
Automating Pharmacovigilance Evidence Generation: Using Large Language Models to Produce Context-Aware SQL0
Bridging the Imitation Gap by Adaptive Insubordination0
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate0
Can Label-Noise Transition Matrix Help to Improve Sample Selection and Label Correction?0
Captured by Captions: On Memorization and its Mitigation in CLIP Models0
Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization0
Catching the Long Tail in Deep Neural Networks0
Anonymity Unveiled: A Practical Framework for Auditing Data Use in Deep Learning Models0
Challenges in Procedural Multimodal Machine Comprehension:A Novel Way To Benchmark0
Circuit-Based Intrinsic Methods to Detect Overfitting0
Closed-Book Training to Improve Summarization Encoder Memory0
CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation0
CNTN: Cyclic Noise-tolerant Network for Gait Recognition0
Codex Hacks HackerRank: Memorization Issues and a Framework for Code Synthesis Evaluation0
Coherence and Diversity through Noise: Self-Supervised Paraphrase Generation via Structure-Aware Denoising0
Collaborative Learning in General Graphs with Limited Memorization: Complexity, Learnability, and Reliability0
Collectionless Artificial Intelligence0
Co-matching: Combating Noisy Labels by Augmentation Anchoring0
Combating Label Noise With A General Surrogate Model For Sample Selection0
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State-Space Layers0
Combining Recurrent, Convolutional, and Continuous-time Models with Linear State Space Layers0
Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets0
Compositional Generalization Requires More Than Disentangled Representations0
Computation with Sequences in a Model of the Brain0
Confidence Adaptive Regularization for Deep Learning with Noisy Labels0
Show:102550
← PrevPage 30 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM-540B (few-shot, k=5)Accuracy95.4Unverified
2Gopher-280B (few-shot, k=5)Accuracy80Unverified
3PaLM-62B (few-shot, k=5)Accuracy77.7Unverified