SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 3140 of 216 papers

TitleStatusHype
DELT: A Simple Diversity-driven EarlyLate Training for Dataset DistillationCode1
Dataset Factorization for CondensationCode1
DataDAM: Efficient Dataset Distillation with Attention MatchingCode1
DiLM: Distilling Dataset into Language Model for Text-level Dataset DistillationCode1
Distilling Datasets Into Less Than One ImageCode1
Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation?Code1
Can pre-trained models assist in dataset distillation?Code1
DiM: Distilling Dataset into Generative ModelCode1
CaO_2: Rectifying Inconsistencies in Diffusion-Based Dataset DistillationCode1
Embarassingly Simple Dataset DistillationCode1
Show:102550
← PrevPage 4 of 22Next →

No leaderboard results yet.