SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 91100 of 216 papers

TitleStatusHype
Finding Stable Subnetworks at Initialization with Dataset Distillation0
Robust Dataset Distillation by Matching Adversarial Trajectories0
Understanding Dataset Distillation via Spectral Filtering0
Secure Federated Data Distillation0
Does Training with Synthetic Data Truly Protect Privacy?Code0
The Evolution of Dataset Distillation: Toward Scalable and Generalizable Solutions0
Trust-Aware Diversion for Data-Effective Distillation0
Dark Distillation: Backdooring Distilled Datasets without Accessing Raw Data0
TD3: Tucker Decomposition Based Dataset Distillation Method for Sequential RecommendationCode0
Knowledge Hierarchy Guided Biological-Medical Dataset Distillation for Domain LLM Training0
Show:102550
← PrevPage 10 of 22Next →

No leaderboard results yet.