SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 191200 of 216 papers

TitleStatusHype
Does Training with Synthetic Data Truly Protect Privacy?Code0
Risk of Text Backdoor Attacks Under Dataset DistillationCode0
Diversity-Driven Synthesis: Enhancing Dataset Distillation through Directed Weight AdjustmentCode0
Towards Adversarially Robust Dataset Distillation by Curvature RegularizationCode0
Neural Spectral Decomposition for Dataset DistillationCode0
Distributional Dataset Distillation with Subtask DecompositionCode0
Going Beyond Feature Similarity: Effective Dataset Distillation based on Class-Aware Conditional Mutual InformationCode0
Distill the Best, Ignore the Rest: Improving Dataset Distillation with Loss-Value-Based PruningCode0
UniDetox: Universal Detoxification of Large Language Models via Dataset DistillationCode0
Dataset Distillation via Adversarial Prediction MatchingCode0
Show:102550
← PrevPage 20 of 22Next →

No leaderboard results yet.