| A Decade's Battle on Dataset Bias: Are We There Yet? | Mar 13, 2024 | Memorization | CodeCode Available | 2 | 5 |
| SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool | Aug 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Learning explanations that are hard to vary | Sep 1, 2020 | Memorization | CodeCode Available | 2 | 5 |
| HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization | Jun 9, 2025 | Combinatorial OptimizationMemorization | CodeCode Available | 2 | 5 |
| Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models | Jun 7, 2023 | DiversityImage Generation | CodeCode Available | 2 | 5 |
| Detecting, Explaining, and Mitigating Memorization in Diffusion Models | Jul 31, 2024 | Image GenerationMemorization | CodeCode Available | 2 | 5 |
| Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning | May 29, 2022 | Few-Shot Text ClassificationMemorization | CodeCode Available | 2 | 5 |
| Drive Like a Human: Rethinking Autonomous Driving with Large Language Models | Jul 14, 2023 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 2 | 5 |
| DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation | Nov 18, 2022 | Code GenerationMemorization | CodeCode Available | 2 | 5 |
| LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models | Apr 14, 2025 | Equation DiscoveryMemorization | CodeCode Available | 2 | 5 |
| Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models | Nov 10, 2023 | GSM8KMemorization | CodeCode Available | 1 | 5 |
| Data Unlearning in Diffusion Models | Mar 2, 2025 | Machine UnlearningMemorization | CodeCode Available | 1 | 5 |
| Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering | Dec 24, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Advancing Cross-domain Discriminability in Continual Learning of Vision-Language Models | Jun 27, 2024 | Continual LearningIncremental Learning | CodeCode Available | 1 | 5 |
| Data Contamination Can Cross Language Barriers | Jun 19, 2024 | Memorization | CodeCode Available | 1 | 5 |
| DAT: Training Deep Networks Robust To Label-Noise by Matching the Feature Distributions | Jun 19, 2021 | Learning with noisy labelsMemorization | CodeCode Available | 1 | 5 |
| Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine Learning | Oct 19, 2023 | DiversityMemorization | CodeCode Available | 1 | 5 |
| C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation | Mar 30, 2023 | Domain AdaptationMemorization | CodeCode Available | 1 | 5 |
| Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations | Mar 21, 2024 | BenchmarkingMemorization | CodeCode Available | 1 | 5 |
| AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ | Sep 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Zero-Shot Compositional Policy Learning via Language Grounding | Apr 15, 2020 | DescriptiveDomain Adaptation | CodeCode Available | 1 | 5 |
| Co-teaching: Robust Training of Deep Neural Networks with Extremely Noisy Labels | Apr 18, 2018 | Image ClassificationLearning with noisy labels | CodeCode Available | 1 | 5 |
| DASH: Warm-Starting Neural Network Training in Stationary Settings without Loss of Plasticity | Oct 30, 2024 | Memorization | CodeCode Available | 1 | 5 |
| Adaptive Early-Learning Correction for Segmentation from Noisy Annotations | Oct 7, 2021 | ClassificationMedical Image Segmentation | CodeCode Available | 1 | 5 |
| Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning | Jul 1, 2024 | Memorization | CodeCode Available | 1 | 5 |